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Translation  Invariant  Multiscale  Analysis 

W.  R.  Madych* 


Abstract 

The  notion  of  multiscale  analysis  introduced  by  R.  R.  Coifman  and 
Y.  Meyer  is  considered  and  the  translation  invariant  case  is  character¬ 
ized. 

1  Introduction 

Recall  that  a  dyadic  multiscale  analysis  of  L2(Rn)  is  an  increasing  sequence 
V  =  {Vj  :  j  =  . . . ,  —  1,  0,  1, 2, . . .}  of  closed  subspaces  of  L2(Rn)  which  has 
the  following  properties: 

0)-  Ur-oo  Vi  is  dense  in  L2(Rn)  and  flji-oo  Vj  =  {0}. 

(2) .  f(x)  is  in  \)  if  and  only  if  f(2x)  is  in  Vj+i. 

(3) .  There  is  a  lattice  T  in  Rn  such  that  for  every  /  in  V'0  and  every  7  in  T 

the  function  /7  is  in  V0.  Here  and  in  what  follows  we  use  the  notation 
fi(x)  =  f(x  ~  7)- 

(4) .  There  are  two  positive  constants  C2  >  C 1  >  0  and  a  function  g  in  1 0 

such  that  Vo  is  the  closed  linear  span  of  <7-,,  7  6  T,  and 
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An  introduction  to  the  subject  may  be  found  in  [1,2].  A  basic  property 
of  a  multiscale  analysis  V  is  the  following: 

(5).  There  is  a  function  <j)  in  F0  such  that  the  collection  {</>-, }->er  is  an  or' 
thonormal  basis  in  Vr0. 

This  fact  may  be  regarded  as  a  substitute  for  (4)  and  plays  an  important  role 
in  what  follows. 

A  dyadic  multiscale  analysis  is  translation  invariant  if  all  the  translates 
of  /,  {fy  :  y  €  /?"},  are  in  F0  whenever  /  is  in  F0. 

The  canonical  example  of  a  translation  invariant  multiscale  analysis  of 
L2(R )  is  when  F0  is  the  collection  of  those  functions  in  L2(R)  whose  Fourier 
transforms  are  supported  in  the  interval  [— 7T,  7r].  A  natural  choice  of  <j>  in 
this  case  is  given  by 

sin  ttx 
nx 

The  point  of  this  paper  is  to  give  a  characterization  of  translation  in¬ 
variant  multiscale  analyses.  For  the  sake  of  clarity  in  what  follows  we  will 
restrict  our  attention  to  the  case  n  =  1  and  T  —  Z,  the  lattice  of  integers. 
The  statements  and  arguments  in  the  general  case  are  completely  analogous 
to  this  basic  case. 

We  now  briefly  digress  to  list  some  of  the  conventions  which  are  used 
here:  The  Fourier  transform  /  of  a  function  /  is  defined  by 


whenever  it  makes  sense  and  distributionally  otherwise.  Basic  facts  con¬ 
cerning  Fourier  transforms  and  distributions  will  be  used  without  further 
elaboration  in  what  follows.  To  avoid  the  pedantic  repetition  of  “almost 
everywhere”  and  othe  modifying  phrases  which  are  inevitably  necessary 
when  dealing  with  functions  defined  almost  everywhere,  all  equalities  be¬ 
tween  functions  and  other  related  notions  are  interpreted  in  the  distribu¬ 
tional  sense  whenever  possible.  The  term  support  is  also  used  in  the  dis¬ 
tributional  sense;  in  particular  the  support  of  a  function  /  in  L2(R)  is  a 
well  defined  closed  set.  If  IF  is  a  collection  of  tempered  distributions  then 
W  is  the  collection  of  Fourier  transforms  of  elements  of  VF,  in  other  words 
IF  =  {/  :  /  =  g  for  some  g  in  VF}.  For  a  subset  Q  of  R  and  a  real  number 
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r  the  sets  rQ  and  f l  +  r  are  defined  by  rfi  =  {x  :  i  =  rw  for  some  w  in  fi} 
and  Q  +  r  =  {x  :  x  =  jj  +  r  for  some  u>  in  Q};  L2(Q)  is  the  L 2  closure  of  the 
subspace  of  those  functions  in  L2(R)  whose  support  is  contained  in  Q.  For 
notational  simplicity  we  use  Q  to  denote  the  closed  interval  [—  ir,  7r], 

We  can  now  conveniently  state  our  main  observation. 

Theorem  Suppose  V  is  a  translation  invariant  dyadic  multiscale  analysis 
of  L2(R).  Then  Vo  =  L2(tt)  where  Q  is  a  closed  subset  of  R  which  has  the 
following  properties: 

(a) ,  fi  C  2Q. 

(b) .  {Q  +  27r,}n{ft  +  2? rk}  is  a  set  of  Lebesgue  measure  0  for  any  pair  of 

integers  such  that  j  /  k. 

(c) .  ur.-„{n  +  2rf}  =  «. 

(i).  U?=,  i2(2*S!)  is  dense  m  L\R). 

Conversely,  ifVk,kdZis  defined  by  1  *  =  L2( 2kQ)  where  Q  is  a  closed  subset 
of  R  which  satisfies  the  properties  above  then  the  sequence  of  subspaces  {V  *} 
is  a  translation  invariant  multiscale  analysis  of  L2(R). 

Remark  1  In  view  of  the  example  given  above  it  is  very  tempting  to  con¬ 
jecture  that  the  set  fi  in  the  Theorem  must  be  of  the  form  U  =  Q  +  a  for 
some  real  number  a  which  satisfies  7T  <  a  <  n.  Certainly  such  Q’s  satisfy 
the  desired  conditions.  However  the  conditions  of  the  Theorem  are  satisfied 
by  fi’s  which  need  not  be  connected  as  the  following  example  due  to  Rudi 
Lorentz  shows. 


Remark  2  Consider 

Qa  =  [-1,1]U[2jt+  1,47 r-  1] 
«*  =  [-5,5] 

Qc  =  [-1, 1] 

Qd  =  [0,  27t]  . 
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It  is  not  difficult  to  verify  that  each  fiQ  listed  above  is  a  closed  set  which 
fails  to  satisfy  condition  (o)  but  satisfies  the  remaining  conditions  in  the 
Theorem.  These  examples  show  that  conditions  (a)-(d)  are  not  redundant. 

Remark  3  Note  that  condition  (a)  implies  that  0  is  contained  in  fi.  In 
addition  to  this  it  is  clear  that  if  fi  contains  a  neighborhood  of  the  origin 
then  it  satisfies  condition  (d).  In  view  of  this  it  seems  reasonable  to  suspect 
that  subsets  fi  which  satisfy  the  conditions  of  the  Theorem  must  contain  an 
open  neighborhood  of  the  origin.  That  this  is  not  the  case  can  be  seen  by 
considering  the  following  example  of  fi: 

oo  oo 

{  U  [-(2  -  2~k)2~kir,  -2~kx]}  [J[0,  v]{  |J  [(2  -  (2  -  2^)2^)^  (2  -  2~k)n]}  . 

k=i  k= i 

Remark  4  In  view  of  the  examples  listed  above  it  may  be  of  some  interest 
to  obtain  a  significantly  more  lucid  description  of  the  set  fi  than  that  given 
in  the  Theorem. 

A  corollary  concerning  wavelets  generated  by  V  is  recorded  at  the  end  of 
Section  2. 


2  Details 


We  begin  by  establishing  a  basic  lemma.  First  recall  that  the  indicator 
function  of  a  set  fi  is  usually  denoted  by  xn  and  satisfies 


1  if  f  €  n 

0  otherwise. 


Lemma  Suppose  V  is  a  translation  invariant  dyadic  multiscale  analysis  of 
L2(R)  and  <p  is  a  function  whose  existence  is  guaranteed  by  (5).  Then 


\<t>\  =  Xn 


where  ls  the  indicator  function  of  a  closed  set  fi  which  has  properties 
(a)-(d)  in  the  statement  of  the  Theorem. 

Let  Q  be  the  support  of  <f>.  To  prove  the  lemma  we  will  first  show  that  fi 
satisfies  property  (b). 
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Recall  that  (5)  implies  that  for  all  /  in  V'0  we  may  write 


(6)  m  =  s(om 

where  g  is  2tt  periodic  and  square  integrable  over  Q.  In  particular,  since  Vo 
is  translation  invariant,  <j)y  is  in  V’o  so  setting  a  =  —y  we  may  write 

eia*m  = 


for  some  such  g.  Hence 

_  Oftm)  =  g(£  —  2? rm)<^(£  —  2irm)  =  —  2 nm) 

which  implies  that 

e.aU-2»m)  _ 

on  fi  +  2irm.  For  two  different  values  of  m  the  last  equality  implies  that 

eia((-2vj)  _  eia(Z-2-nk) 


on  {fi  +  2nj}  f|{Fi  +  2xk}.  Re-expressing  the  last  relation  as 

eia(t-2*k)^ei2ira(k-j)  _  Q 


it  is  clear  that  either  a  is  an  integer,  j  is  equal  to  k,  or  {0  +  2^}  fl{^  +  2Txk) 
is  a  set  of  measure  zero.  Since  a  may  be  any  real  number  we  conclude  that 
{Q  +  2irj}  +  2ftk}  has  measure  zero  whenever  j  ^  k. 

Now,  since  <?!>*(£)  =  e_,fe^(£),  k  £  Z,  are  orthonormal,  we  may  write 

(7)  /  4>k{x)Mx)dx=  I  e'mim)\2dt  = 

Jr  Jr 


(  e‘mf  Y  |^(£~  27r;)|2di  = 


where  m  =  l  —  k.  The  last  equality  implies  that 


1  when  m  =  0 
0  otherwise 


Y  !<£(£  ~27r»l2 

jez 


_1_ 

2w 
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on  R  and  since  {S7  -f  27 rj)  f]{ft  +  2t <k]  has  measure  zero  whenever  j  ^  k  we 
may  conclude  that 

M?)i  = 

and 

U  {Q  +  2irk}  =  R. 
kez 

To  see  (a)  observe  that  (2)  and  the  facts  demonstrated  above  imply  that 

Xn(0  =  h{QXn(Z/2) 

where  h  is  47r  periodic  and  square  integrable  over  2 Q.  Since  X2n(0  = 
Xn(£/2),  the  last  equality  involving  h  implies  that  xn  vanishes  whenever 
X2n  does  so  Q  C  2Q. 

Finally,  the  fact  that  U*Li  L7(2kQ)  is  dense  in  L7{R)  is  an  immediate 
consequence  of  property  (1).  The  proof  of  the  Lemma  is  complete. 

Now,  suppose  <j>  and  Q  are  as  in  the  Lemma  and  its  proof.  Since  V'0 
consists  of  functions  /  which  satisfy  (6)  it  is  clear  that  1'0  is  contained  in 

To  see  that  V0  =  Z,2(Q)  let  /  be  any  element  in  L2(Q)  and  let  h  be  defined 

by  _ 

*(£)  =  /  <?(sc)/i^)i 

\  0  otherwise. 

By  virtue  of  the  properties  of  Q  established  above  it  is  clear  that 

y=*n(«=  i(0 

and 

ho  =  ho  ho 

where 

9(0  =  &  (nhz-  27rj)  ]  ( 12  -  2itj) 

\j€Z  j  \jtz 

Thus  /  satisfies  (6)  and  hence  we  may  conclude  that  L2(n)  is  contained  in 
Vo- 
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The  Lemma  together  with  the  last  observation  imply  the  first  assertion 
of  the  Theorem. 

To  see  the  converse,  let  V  be  the  sequence  of  subspaces  {!*},  k  €  Z , 
defined  by  \\  =  L2( 2kQ)  where  Q  is  a  closed  set  which  satisfies  properties 
(a)-(d)  of  the  Theorem. 

The  fact  that  V  is  translation  invariant  and,  in  particular,  satisfies  prop¬ 
erty  (3)  with  F  =  Z  is  an  immediate  consequence  of  the  definition.  Property 
(2)  is  also  immediate.  That  V  is  an  increasing  sequence  of  subspaces  and 
Ufc<=z  I  *  is  dense  in  L2(R)  are  consequences  of  properties  (a)  and  (d). 

That  D*ez  V*  =  {0}  follows  from  the  fact  that  the  measure  of  Q  is  finite. 
Indeed,  its  measure  is  2r  which  can  be  seen  from 

I  Xn m  =  £  /  .  Xn(f)df  =  /  £  Xn(t  ~  2*j)4  =  ^ 

Jr  'q  jez 

by  using  properties  (b)  and  (c). 

Finally,  to  see  property  (5)  take 

and  use  properties  (b)  and  (c)  to  write  (7)  which  shows  that  A:  €  Z, 

are  orthonormal  and,  for  /  in  Vr0, 

m  =  vs  rg  /<«  -  2  Kk)\  m 

or 

/(*)  =  £  /OX1  ~  k) 

kez 

which  shows  that  they  are  complete  in  Vq. 

This  completes  the  proof  of  the  Theorem. 

Remark  5  Suppose  V  is  a  translation  invariant  multiscale  analysis  and  Q  is 
a  closed  set  such  that  V'0  =  L2(Q).  Then  if  H'o  is  the  orthogonal  complement 
of  r0  in  Vi,  IV0  =  /-2(T)  where  T  =  2f2  \  17.  Let  rp  be  such  that  the  set 
{xpi,  :  k  €  Z}  is  an  orthonormal  basis  for  Ur0.  Such  a  ip  may  be  referred  to 
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as  a  wavelet.  Using  reasoning  analogous  to  the  proof  of  the  Theorem,  it  is 
clear  that  t-’  is  a  wavelet  if  and  only  if 

M  =  ■ 

Now,  an  analyzing  wavelet  in  the  sense  of  Meyer,  [l],  is  globally  integrable 
and  hence  its  Fourier  Transform  must  be  continuous.  Clearly  ip  is  not  such 
an  analyzing  wavelet. 

Corollary  .4  translation  invariant  multiscale  analysis  cannot  give  rise  to 
analyzing  wavelets  in  the  sense  of  Meyer. 

References 
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Polyharmonic  Splines,  Multiscale  Analysis, 
and  Entire  Functions 

W.  R.  Madych* 


Abstract 

Two  applications  of  cardinal  polyharmonic  splines  are  considered. 

First,  the  notion  of  multiscale  analysis  in  the  sense  of  R.  Coifman 
and  Y.  Meyer  is  defined  and  it  is  shown  that  certain  subclasses  of 
polyharmonic  splines  generate  such  analyses.  Next,  it  is  shown  that 
polyharmonic  splines  can  be  used  in  a  summabilitv  method  for  the 
recovery  of  a  large  class  of  entire  functions  of  exponential  type  from 
samples  taken  on  the  integer  lattice. 

1  Introduction 

If  A:  is  a  positive  integer  an  n- variate  k-harmonic  cardinal  spline  is  a  tempered 
distribution  /  on  Rn  such  that  Afc/  is  a  measure  supported  on  the  integer 
lattice  Zn  in  Rn.  Symbolically 

(1)  &hf(x)  =  Y,  a'Ax  ~  j) 

jezn 

where  A  is  the  n-  variate  Laplacian,  A*  =  A  A*-1  for  k  >  1,  and  6(x)  is  the 
unit  Dirac  measure  supported  at  the  origin.  The  class  of  ^-harmonic  cardinal 
splines  on  Rn  is  denoted  by  SHk(Rn).  A  polyharmonic  spline  is  one  which  is 
A:-harmonic  for  some  k. 

The  basic  properties  of  these  splines  are  recorded  in  [12,  13].  Here  we  are 
primarily  concerned  with  certain  applications.  More  specifically,  in  Section  2 
we  show  that  certain  subclasses  of  the  spaces  SHk{Rn )  provide  examples  of 
multiscale  analyses  whose  general  theory  was  developed  recently  by  Y.  Meyer 
and  his  collaborators.  In  Section  3  we  show  that  polyharmonic  splines  provide 
a  summability  method  for  the  recovery  of  entire  functions  of  exponential  type 
from  samples  on  the  lattice  Zn.  The  introductions  to  these  sections  contain 
a  more  complete  description  of  their  contents.  Various  comments  which  are 

"Department  of  Mathematics,  University  of  Connecticut,  Storrs,  CT  06268.  Partially 
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not  particularly  germane  to  the  development  but  are  pertinent  to  the  subject 
matter  are  given  in  Section  4. 

The  notation  used  here  is  standard,  if  necessary  see  [13]  for  a  more  de¬ 
tailed  explanation.  Here  we  merely  remind  the  reader  that  there  are  several 
common  normalizations  for  the  Fourier  transform.  In  this  note  we  use 

tp(£)  =  (2 Tv)~nt7  f  ip{x)e~x^,x't  dx 

JRn 

for  the  Fourier  transform  ip  of  a  test  function  ip.  Also,  for  convenience,  we  use 
the  notation  T(f)/(£)  for  the  product  of  a  distribution  T  and  a  function  / 
even  when  such  an  expression  makes  no  sense  pointwise;  however,  its  meaning 
in  the  distributional  sense  should  be  clear  from  the  context. 

2  Multiscale  analysis 

Recall  that  a  dyadic  multiscale  analysis  of  L2(Rn)  is  an  increasing  sequence 
V  =  { Vj  :  j  =  . . . ,  —  1, 0, 1,  2, . . .}  of  closed  subspaces  of  L2(Rn )  which  has 
the  following  properties: 

0).  Ujl-oo  Vi  is  dense  in  L2{Rn)  and  fljUoo  Vi  ~  {°}- 

(ii) .  f(x)  is  in  Vj  if  and  only  if  /( 2x)  is  in  V j+1. 

(iii) .  There  is  a  lattice  T  in  Rn  such  that  for  every  /  in  Vr0  and  every  7  in  T 

the  function  /-,  is  in  T0.  Here  and  ir.  what  follows  we  use  the  notation 

fi{x)  =  /(*  ~  T)- 

(iv) .  There  are  two  positive  constants  >  Ci  >  0  and  a  function  g  in  F0 

such  that  V’o  is  the  closed  linear  span  of  g1,  7  6  T,  and 


An  introduction  to  the  subject  may  be  found  in  [5,  6,  9].  A  basic  property 
of  a  multiscale  analysis  V  is  the  following: 

(v).  There  is  a  function  <p  in  lo  such  that  the  collection  {^7}7gr  is  an  or¬ 
thonormal  basis  in  Vo- 

This  fact  may  be  regarded  as  a  substitute  for  (iv)  and  plays  an  important 
role  in  what  follows. 

Finally,  we  say  that  a  dyadic  multiscale  analysis  is  composed  of  generalized 
spline  functions  in  the  sense  of  Y.  Meyer  if  all  the  elements  of  the  subspace 
Vo  are  continuous  and  the  mapping  which  maps  /  into  the  sequence  of  values 
{/(7)h€r  is  an  isomorphism  from  Vr0  onto  ^2(T). 
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In  what  follows  we  will  restrict  our  attention  to  the  case  T  =  2",  the 
lattice  of  integers.  The  statements  and  arguments  in  the  general  case  are 
completely  analogous  to  this  basic  case.  Note  that,  by  virtue  of  item  (ii)  in 
the  above  definition,  a  multiscale  analysis  is  determined  by  the  subspace  V'0. 
For  any  positive  integer  k  consider  the  subspace 

V0(k)  =  SHk(Rn)f]L7(Rn)  . 

For  any  integer  m  define 

Vm(k)  =  {f  :  /( 2~mx)  isinVo(A)}- 

In  other  words,  /  is  in  V0(k)  if  and  only  if  /( 2mx)  is  in  Vm(k).  The  questions 
we  address  in  this  section  are  the  following:  For  what  values  of  k,  if  any,  is 
th_  increasing  family 


(2)  V(k)  =  {Vm(k)  m=..., -1,0, 1,2,...} 

a  dyadic  multiscale  analysis  of  L2(Rn)  and  when  is  it  composed  of  spline 
functions  in  the  sense  of  Y.  Meyer. 

The  answers  to  the  above  questions  are  not  difficult.  We  begin  by  recalling 
several  facts  concerning  SHk(Rn). 

If  /  is  in  SHk(Rn )  then  for  x  in  a  sufficiently  small  neighborhood  of  j 


(3)  f{x)  =  a:Ek{x  -  j)  +  h(x) 

where  Ek  is  the  fundamental  solution  of  A*,  namely, 


(4) 


f  c(n,  fc)|x|2fc  n  if  n  is  odd 

|  c(n,  A:)|x)2fc~”  log  |x|  if  n  is  even  , 


here  the  constant  c(n,  k )  depends  only  on  n  and  k  and  is  chosen  so  that 
A kEk(x)  =  6(x);  h  is  infinitely  differentiable  and  aj  is  the  constant  in  rep¬ 
resentation  (1)  of  A */•  Moreover  if  A kf  =  0  then  /  is  a  ^-harmonic  poly¬ 
nomial.  These  observations  together  with  the  behavior  of  Ek  at  the  origin 
allow  us  to  conclude  the  following: 

Proposition  1  If  4k  <  n  then  \  '0(k)  —  {0}. 


If  4k  >  n  then  the  function  <Pk  defined  by  the  formula  for  its  Fourier 
transform 

ier2* 


(5) 


4>fc(0  =  (27T)-"/2 


(EjeZn  1C  "  27rj|-“) 


1/2 


is  well  defined  and  is  in  Note  that,  by  virtue  of  the  same  reasoning 

as  used  in  [13],  $k  can  be  extended  to  a  function  which  is  holomorphic  in 
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a  tube  containing  Rn  in  complex  rc-space.  As  a  consequence  <I>A  enjoys  the 
representation 

$*(*)  =  E  aiE*(x  -  j) 

j  ezn 

where  the  sequence  {aj}  decays  exponentially,  namely, 

|aj|  <  Ce-f|j> 

and  where  C  and  e  are  positive  constants  which  depend  only  on  k  and  n. 
Observe  that 

L  *‘(l  ~  j)4,‘("  ~  m)dx  =  (2,r)"”  L 

=  (27r)-n  /  e'<m_j’^d^ 

Jq n 

where 

Qn  =  {C  =  (Ci,  •  •  • .  €n)  :  -*■  <  <  7T,  j  =  1, . . . ,  n} 

and  hence  the  collection 

(6)  {«*(*- j)  :  j€Z"} 

is  an  orthonormal  set  in  \'o(k).  Thus  V0(k)  certainly  contains  much  more 
than  the  0  element  in  this  case.  As  it  turns  out,  the  collection  (6)  is  also 
complete  in  \’o{k). 

To  see  this,  let  /  be  any  element  in  1  o(&).  Viewing  /  as  a  distribution  on 
So(Rn )  =  {<f>  €  S(Rn)  :  Du<j>{Q)  =  0  for  all  multi-indices  j/}  , 
it  is  clear  that 

(7)  m  =  /’(flier* 

where  P  is  locally  square  integrable  and  periodic,  that  is,  P(£  —  27rj)  =  P(f) 
for  all  j  in  Zn.  Here  S(Rn)  is  the  Schwartz  space  of  rapidly  decreasing 
functions  on  Rn.  It  follows  from  (7),  the  periodicity  of  P,  and  the  fact  that 
/  is  square  integrable  that 


Q{  A  =  (2  tT/2p(0 


(e  ' 

\j€Z"  J 


is  locally  square  integrable.  Hence  we  may  write 


/(e)  =  <?(e)<Mfl 
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or 

}{x)  =  52  aj<Mz  -  j) 

j  €Z" 

where 

Qtt)  =  52  aje_,<J'f>  and  52  laj|2  <  00  • 

j€Zn  jezn 

As  a  result  we  may  conclude  that  (6)  is  a  complete  orthonormal  set  in  V'0 
and  the  family  V(k)  satisfies  property  (v). 

The  fact  that  the  family  V(A:)  enjoys  properties  (it)  and  (in)  is  an  imme¬ 
diate  consequence  of  the  definition.  Hence,  to  conclude  that  V(k)  is  a  dyadic 
multiscale  analysis,  it  remains  to  show  that  it  satisfies  property  (i). 

First,  to  see  that  flm=-oo  F'm(A:)  =  {0}  observe  that  if  /  is  in  0m=-oo  Vm(k) 
then  /  must  be  a  ^-harmonic  polynomial  in  L2(Rn)  and  consequently  /  =  0. 

To  conclude  that  Um=-oo  is  dense  in  L2(Rn)  we  argue  as  follows: 

Let  /  be  any  element  in  L2(Rn)  such  that  /  is  bounded  and  has  compact 
support.  Define  sm  to  be  the  ^-harmonic  spline  whose  Fourier  transform  is 

««)  =  Cm«)|f|-2‘ 

where  Pm  is  the  2m+17r  periodization  of  |^|2fc/(^),  namely, 

Pm(Z)  =  52  If  -  2m+1*tfkm-  2m+Vj)  . 

j€Z" 

Clearly  sm  is  in  V'm(A:).  Now  for  m  sufficiently  large  w>e  may  write 

11/  -  =  JR„  1/(0  - 

=  /  ii„(oi  *jh<cL  ir“« 

0|{|>2m7r 

where  C  is  a  constant  independent  of  m.  Finally,  since  the  last  integral  in 
the  above  string  clearly  goes  to  0  as  m  goes  to  infinity  and  the  class  of  such 
/’s  is  dense  in  L2(Rn),  we  may  conclude  that  the  desired  result  holds. 

We  summarize  these  observations  as  follows: 

Proposition  2  If  4k  >  n  then  the  increasing  sequence  of  subspaces  V(k) 
defined  by  (2)  is  a  dyadic  multiscale  analysis  of  L2(Rn).  Furthermore,  if  $k 
is  defined  by  (5)  then  the  collection  (6)  is  a  complete  orthonormal  basis  for 
Vo  (*)• 

Note  that  if  2 k  <  n  then  the  non-zero  elements  of  Vo{k)  are  not  locally 
bounded.  On  the  other  hand  if  2A:  >  n  then  the  elements  /  of  Vo[k)  are 
continuous  and  enjoy  the  representation 

(8)  /(*)  =  £  /( j)i*<*  -  j) 

j€Z“ 
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where  Lk  is  defined  by  the  formula  for  its  Fourier  transform 


(9) 


UZ)  =  (2  tt)-"'2 


Ifl 


-2k 


Ej €2"  If  -  lirjl 


—2k 


and  the  series  converges  uniformly  for  x  on  compact  subsets  of  Rn.  For  each 
such  /  we  may  write 

!>(«.)  =  /R„ 


(Ej£2-  |{-2xj-»)' 


where 


(2?r )n/2p/(f)  =  £  /(j)e  ,(j’°  . 

j€2" 

Since  there  are  constants  0  <  Cj  <  C 2,  independent  of  f  such  that 


Ci  < 


Ei£z-  |g  -  2tJ 


-ifc 


<  C, 


(E*z-  I?  -  2x|-»)‘ 

we  may  conclude  from  the  last  formula  for  ||/|[i2(fir.)  that 


o  £  l/(j)f  <  ll/llli,*.,  <  C,  £  i/(j)|J 

jezn  j€Zn 

. _  I  *  ir  »  \  101 

whenever  ,/  is  111  anu  zk  ^  /t. 

In  other  words  we  have  the  following: 

Proposition  3  If  2k  >  n  then  the  increasing  sequence  of  subspaces  V(£) 
defined  by  (2)  is  a  dyadic  multiscale  analysis  of  L2(Rn )  which  is  composed 
of  spline  functions  in  the  sense  of  Meyer.  Furthermore,  in  this  case  the 
complete  orthonormal  basis  forV(k)  defined  by  (5)  and  (6)  enjoys  exponential 
decay,  namely, 

|«M*)I  <  Ce-'W 

and  where  C  and  e  are  positive  constants  which  depend  only  on  k  and  n. 


3  Entire  functions  of  exponential  type 

Suppose  /  is  a  tempered  distribution  on  Rn  whose  Fourier  transform  /  is 
supported  in 

Qn  =  {f  =  (ft,  •  •  •  1  fn)  :  ->r  <  f j<rr,  j  =  l,...,n}  . 
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It  is  a  consequence  of  a  well  known  generalization  of  a  Paley-Wiener  theorem 
that  such  an  /  is  analytic,  is  of  polynomial  growth  on  Rn,  and  its  holomorphic 
extension  to  complex  n-space  is  an  entire  function  of  exponential  type.  The 
class  of  all  such  /’ s  will  be  denoted  by  £',r(./2"). 

In  particular,  given  such  an  /  the  sequence  of  values 

(10)  {/(j) 

is  well  defined.  If  ,  in  addition,  /  is  in  L2(Rn)  then  it  is  well  known  that  / 
can  be  recovered  from  these  values  via  the  formula 

(11)  /(*)  =  H  /(j)  smc(x  ~  j) 

j€ZB 

where 

sin(7TXj) 
mj 

Here  the  series  converges  uniformly  and  absolutely  on  compact  subsets  of 
Rn;  it  also  converges  in  L2(Rn).  Similar  results  hold  if  the  sequence  of  values 
(10)  is  in  lp(Zn)  for  some  p,  1  <  p  <  oo.  Unfortunately,  if  the  sequence  of 
values  (10)  fails  to  be  in  ip{Zn)  for  some  p,  1  <  p  <  oo,  the  series  (11)  may 
not  make  sense.  On  the  other  hand,  since  the  sequence  (10)  is  of  polynomial 
growth  the  fc-harmonic  spline  interpolant 

(12)  &/(*)=  £  /(j)U(x-j) 

jezn 

is  well  defined  for  all  /  in  EK(Rn)  and  all  k  such  that  2k  >  n.  In  view  of  the 
fact  that 

lim  Lk(x )  =  sinc(r) 

k-+  oo 

it  is  natural  to  use  (12)  to  approximate  /  with  the  expectation  that 

(13)  lim  Skf(x)=f(x). 

K—*00 

Such  a  development  is  reminiscent  of  classical  summability  theory  for  Fourier 
series.  In  what  follows  we  will  show  that  (13)  holds  for  a  relatively  wide  class 
of  /’s  in  Er(Rn). 

To  set  up  our  result  we  will  need  several  technical  definitions.  First,  recall 
<.hat  if  T  is  a  distribution  with  compact  support  then  the  periodization  of  T, 
pT,  can  be  defined  by 

(Pr,<»  =  (7><») 

where  <j>  is  any  test  function  in  the  Schwartz  class  S(Rn), 

=  £  <£(£  -  27ri) » 

j€Z" 


n 

sinc(z)  =  sinc(ij, . . . ,  xn)  =  [] 
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and  (T,<f>)  denotes  the  distribution  T  acting  on  the  test  function  6.  Note 
that  pT  is  a  well  defined  periodic  tempered  distribution. 

Observe  that  if  /  is  in  ET(Rn)  then 

(o)  STno  =  (2*)”'V/(e)i*(  o 

where  the  multiplication  on  the  right  hand  side  makes  sense  because  Lk  is 
smooth. 

Define  the  functions  A  and  x  by  the  formulas 
(15)  (2»)-'!x({)  =  A(f)  =  lim  £,K) . 


Note  that  x(f)  is  1  if  f  is  in  the  interior  of  Qn  and  0  if  f  is  in  Rn  \  Qn.  If  f  is 
in  the  boundary  of  Qn  then  x(f)  =  2_m  if  exactly  m  components  of  f  have 
modulus  7r. 

Finally,  recall  that  T  is  said  to  be  a  distribution  of  order  0  in  an  open  set 

n  if 


for  all  <f>  in  C£°(Q)  where  C  is  a  constant  independent  of  <f>.  We  remind 
the  reader  that  given  such  a  T,  by  virtue  of  Riesz  representation,  there  is  a 
bounded  Borel  measure  p  such  that 


{t,i >>=  /  <komo- 

Jn 

Furthermore,  if  fij  is  an  open  set  whose  closure  is  in  D  and  g  is  any  bounded 
function  which  is  infinitely  differentiable  on  Rn\Qi  then  the  product  g(£)T(l;) 
is  well  defined  as  a  distribution  on  Rn.  Namely,  for  any  test  function  xp  this 
product  is  defined  by 


(gT,  xp)  =  ( gT ,  xp<p)  +  { gT ,  tf'(l  -  cf>)) 

J  n 

where  <p  is  a  function  in  C£°(Q)  such  that  <p(£)  =  1  for  f  in  Note  that  this 
definition  is  independent  of  the  particular  choice  of  <f>  and  that  the  resulting 
product  is  tempered  whenever  T  is  tempered. 

Proposition  4  Suppose  f  is  in  E7r(Rn),  f  is  a  distribution  of  order  0  in 
some  neighborhood  of  the  boundary  of  Qn,  and  x(0 p/(0  =  /(f)  •  Then 

Jim  Skf{x)  =  f{x) 

K— »  OO 

uniformly  on  compact  subsets  of  Rn. 
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Proof  Choose  e,  0  <  t  <  tt,  such  that  /  is  a  distribution  of  order  zero  in 
0  =  {£  :  distance  from  £  to  the  boundary  of  Qn  is  <  c}  . 

Set 

Qi  =  {£  :  distance  from  £  to  the  boundary  of  Qn  is  <  e/2}  . 

Let  fa  be  a  function  in  C£°(Q)  such  that  <^i(C)  =  1  for  f  in  Qi  and  let  fa  and 
<f>2  be  infinitely  differentiable  functions  such  that  <£o(£)  +  ^i(f)  +  ^2(0  =  1  for 
all  f  in  Rn,  fa  has  support  in  Qn\Q i,  and  fa  has  support  in  Rn  \  ( Qn  (J^i)- 
Now,  if  ex  denotes  the  exponential  ex(£)  =  e’*x’^  we  may  write 

(16)  Skf(x)  -  /( x)  =  (Xfcp/,  ex)  -  (2tt )~n/2(/,  ex) 

=  <(£*- A)p/,  ex)  =  £  ((Lk-\)pf,exfa). 

i =o,t,2 

Consider  each  term  which  comprises  the  sum  in  the  last  expression. 

First 

|((£*  ~  A)p/,  e*<f>o)\  =  | (/,  (Lk  ~  \)exfa)\ 

<  c  J2  sup  l^((i*(0  ~ 

\v\<NitQn\ni 

since  /  is  a  distribution  of  order  no  greater  than  N.  Because  for  each  v 
\Dv(Lk(£)  —  A(£))|  goes  to  zero  as  k  —>  oo  uniformly  for  f  in  Qn  \  Qx  and 
\Dvex(^)\  <  C„|i|H  it  follows  from  the  last  equality/inequality  string  that 

(17)  lim  \((Lk  -  A )pf,  exfa)\  =  0 

k— *oo 

uniformly  for  x  in  bounded  subsets  of  Rn. 

Next,  there  is  a  bounded  Borel  measure  fi  such  that 

((U  -  A)p /,  «,*,)  =  [  (ik( ()  -  MOMOMQM 0 

Jn 

and,  by  virtue  of  the  bounded  convergence  theorem,  we  may  conclude  that 

(18)  lim  \((Lk  -  A)p/,  e.xfa)\  =  0 

k—k  oo 

uniformly  in  x. 

Finally, 

\({Lk  -  A )p/,  exfa)\  =  | </,  p[{Lk  -  X)exfa])\ 

<CIE  sup|/r((ifc(0-A(0)e,(0M€)l 

j€2nH<^Vf€<2j 
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where  QJ  =  {£  :  f  —  2"j  is  in  Q”}.  Note  that  the  term  corresponding  to 
j  =  0  is  0  because  of  the  factor  <f> i  and  that  the  series  converges  whenever 
2k  >  n.  Estimating  the  size  of  each  term  in  the  last  sum  following  essentially 
the  same  reasoning  as  used  in  the  case  of  the  term  involving  <f>o  allows  us  to 
conclude  that 

(19)  lim  |((Lfe  -  \)pf,  exfc)\  =  0 

K—*00 

uniformly  for  x  in  bounded  subsets  of  Rn. 

Formulas  (17),  (18),  (19),  and  (16)  imply  the  desired  result.  ■ 

Corollary  1  Suppose  f  is  in  E„(Rn),  f  is  a  distribution  of  order  0  in  some 
neighborhood  of  the  boundary  of  Qn .  Then 

lim  Skf(x)  =  f(x)  +  g(x) 

k—*oo 

uniformly  on  compact  subsets  of  Rn  where 

ho = *(fW(f)  -  m 

is  a  distribution  with  support  in  the  boundary  of  Qn 

Examples  show  that  the  condition  that  /  be  a  measure  in  a  neighborhood 
of  the  boundary  of  Qn  is  not  necessary.  However  other  examples  show  that 
without  some  restriction  on  the  behavior  of  /  near  the  boundary  of  Qn  the 
sequence  Skf  may  fail  to  converge. 

Many  subclasses  of  distributions  in  E„(Rn)  satisfy  the  hypothesis  of 
Proposition  4.  Consider  the  foilow'ing  transparent  examples: 

•  /  is  in  Lp(Rn)  for  some  p,  1  <  p  <  2. 

•  Aa  f  is  in  Lp{Rn )  for  some  p,  1  <  p  <  2  and  some  positive  number  a. 
Here  A “/(£)  =  |£|Q/(£)-  Tliis  includes  various  classes  of  distributions 
which  themselves  may  not  be  in  Lp(Rn )  but  whose  derivatives  are. 

Such  examples  include  many  of  the  classes  considered  by  Schoenberg  in  the 
univariate  case. 

4  Remarks 

Various  forms  and  features  of  multiscale  analysis  have  been  known  and  used 
in  harmonic  analysis  for  some  time.  We  cannot  go  into  details  here.  The 
formal  definitions  used  here  were  adopted  from  [9],  The  current  popularity 
of  multiscale  analysis  is  due  to  primarily  to  the  efforts  of  R.  R.  Coifman  and 
Y.  Meyer  and  their  collaborators.  An  introduction  to  the  subject  together 
with  further  developments  and  more  references  may  be  found  in  [5,  6,  9]. 
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Orthogonal  bases  composed  of  spline  functions  (piecewise  polynomials) 
in  the  univariate  case  have  been  considered  quite  early,  for  example  see  [4,  7]. 
However  the  introduction  of  such  bases  which  are  translates  and  dilates  of 
one  function  (wavelet)  is  more  recent;  it  can  be  found  in  [21]. 

In  the  case  n  —  2  the  wavelets  corresponding  to  V(k)  can  be  easily  con¬ 
structed  using  the  recipe  given  in  [9].  Their  construction  in  the  general  case 
is  not  so  clear. 

The  distributional  variant  of  the  Paley-Wiener  Theorem  alluded  to  in  the 
introduction  to  Section  3  can  be  found  in  [8].  For  basic  facts  concerning 
sampling  and  the  n- variate  sine  series  see  [16]. 

At  this  workshop  Sherman  Riemenschneider  pointed  out  that  certain  vari¬ 
ants  of  Proposition  4  were  known  in  the  univariate  case,  see  [17].  When  /  is 
in  E.K(Rn)  f|  L2(Rn)  then  (12)  also  holds  in  the  L2(Rn)  sense.  Similar  results 
hold  for  other  subclasses  of  Er{Rn).  For  examples  in  the  univariate  case  see 
[15,  19];  we  are  currently  preparing  various  analogues  in  the  n- variate  case. 

The  material  in  [1,  2,  3,  9,  10,  11,  18]  indicates  that  box  splines  also 
generate  interesting  examples  of  multiscale  analyses.  However,  summability 
theory  for  regularly  sampled  functions  in  En(Rn)  using  box  spiines  instead 
of  ^-harmonic  splines  seems  to  be  more  difficult,  see  [1,  18]  and  the  pertinent 
references  cited  there. 
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Summability  and  Approximate 
Reconstruction  from  Radon  Transform 

Data 

VV.  R.  Madych* 


Abstract 

We  derive  various  reconstruction  formulas  using  complete  and  in¬ 
complete  Radon  transform  data.  In  particular,  the  limited  angle  prob¬ 
lem  is  treated.  These  formulas  are  of  convolution  type  and  are  remi¬ 
niscent  of  summability  theory  for  Fourier  series.  In  cases  where  exact 
reconstruction  is  not  possible,  formulas  for  approximate  reconstruction 
are  given  together  with  bounds  on  the  error  which  are  asymptotically 
optimal. 

1  Introduction 

Recall  that  the  Radon  transform  of  a  scalar  valued  function  /  on  Rn  is  defined 
by 

(1)  'R.f(uj)  =  f  f(tu  + y)dy 

where  u  is  an  element  of  the  unit  sphere  S"-1  in  Rn,  t  is  a  real  number,  and 
=  {y  €  Rn  :  (y,  u)  =  0}  is  the  hyperplane  through  the  origin  which  is 
perpendicular  to  u.  Thus  7 Zf(u,t)  is  the  integral  of  /  over  the  hyperplane 
{y  €  Rn  :  (y,u)  =  /}  which  is  perpendicular  to  u  and  intersects  tu.  The 
mapping  /  — ►  JZf  maps  functions  /  defined  on  Rn  to  functions  7 Zf  defined 
on  Sn_l  x  R. 

In  what  follows  we  will  often  consider  R  f(u,  t )  as  a  family  of  functions  of 
the  variable  t  parametrized  by  u.  In  such  cases  we  will  use  the  notation 

(2)  /„(<)  =  7?./(u,  0  • 

Thus  /„  is  the  function  of  the  real  variable  i  defined  by  (2). 

A  problem  associated  with  R  which  has  attracted  wide  attention  concerns 
the  recovery  of  the  phantom  /  from  full  or  partial  knowledge  of  JZf.  In 

‘Department  of  Mathematics,  University  of  Connecticut,  Storrs,  CT  0G269.  Partially 
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this  report  we  address  certain  aspects  of  this  problem.  Our  approach  is 
motivated  by  the  classical  theory  of  Fourier  series  and  the  constructive  theory 
of  functions.  The  main  ideas  may  be  described  as  follows: 

For  certain  classes  of  convolution  kernels  K ,  the  convolution  K  *  /  can 
be  conveniently  expressed  in  terms  of  R.f.  Such  K  can  be  chosen  to  be 
approximates  of  the  identity.  Doing  so  results  in  approximations  of  /  and, 
in  the  limit,  leads  to  various  reconstruction  formulas  and  algorithms.  This 
is  reminiscent  of  summability  theory  for  Fourier  series. 

In  the  case  of  incomplete  data  the  class  of  kernels  K  for  which  K  *  f 
may  be  conveniently  expressed  is  considerably  more  narrow  than  in  the  case 
of  full  data  Tv/.  Here  we  consider  data  of  the  form  {7£/(u,  i)  :  —  oo  < 
t  <  oo ,  u  €  A}  where  A  is  a  proper  subset  of  Sn_I.  For  certain  subsets 
A  the  problem  of  reconstruction  from  such  data  is  sometimes  referred  to  as 
the  limited  angle  problem.  We  show  that  it  is  possible  to  express  K  *  f  in 
terms  of  this  data  for  a  sufficiently  wide  class  of  kernels  K  to  obtain  various 
summability  theorems  and  approximate  reconstruction  formulas  which  result 
in  the  “correct”  degree  of  approximation. 

1.1  Contents 

This  paper  is  organized  as  follows: 

Section  2  contains  a  description  of  the  basic  results  in  the  case  of  full 
data.  Various  examples  are  contained  in  Section  3.  Certain  technical  results 
needed  for  the  development  of  the  partial  data  case  are  also  contained  in 
Section  3. 

Section  4  is  perhaps  the  most  interesting;  it  contains  a  description  of 
a  class  of  reconstruction  and  approximate  reconstruction  formulas  for  /  in 
terms  of  {/„  :  u  €  *4}  where  A  is  a  proper  subset  of  S”-1.  Error  bounds  are 
also  included. 

Various  comments  which  are  not  particularly  germane  to  the  development 
but  are  pertinent  to  the  subject  matter  are  given  in  Section  5. 

The  introduction  to  each  section  contains  a  rough  indication  of  its  con¬ 
tents. 

1.2  Notation 

We  now  briefly  digress  to  list  some  of  the  conventions  which  are  used  here: 
The  Fourier  transform  /  of  a  function  /  on  Rn  is  defined  by 

m  =  (2^)—^  /  c-**s(x)iz 

JRn 

whenever  it  makes  sense  and  distributionally  otherwise.  Ba-  ic  facts  con¬ 
cerning  Fourier  transforms  and  distributions  will  be  used  without  further 
elaboration  in  what  follows;  for  example,  see  [10].  The  term  support  is  also 
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used  in  the  distributional  sense;  in  particular  the  support  of  a  measurable 
function  /  is  a  well  defined  closed  set.  Lp(Rn )  is  the  usual  Lebesgue  class  of 
measurable  functions  on  Rn ,  see  [10].  If  ft  is  a  subset  of  Rn  then  Ip(ft)  is 
the  Lp  closure  of  the  subspace  of  those  functions  in  Lp(Rn )  whose  support  is 
contained  in  ft. 

The  symbol  c  is  used  to  denote  generic  constants.  Its  value  depends  on 
the  context. 

2  Full  data 

As  mentioned  in  the  introduction,  in  this  section  we  indicate  how  to  obtain 
general  formulas  for  approximations  of  /  in  terms  of  'Rf  which  are  analogous 
to  the  classical  summability  formulas  associated  with,  for  example,  the  Fejer 
or  Abel  kernels  in  the  theory  of  Fourier  series.  These  formulas  are  based  on 
certain  elementary  observations. 

2.1  Summability  Formulas 

A  function  H  on  Rn  is  said  to  be  a  ridge  function  if  II  (x)  =  h((x,u)) 
for  some  unit  vector  u  and  some  univariate  function  h.  If  H  is  such  a  ridge 
function  then  convolving  it  with  another  function  /  on  Rn  and  expressing 
the  integral  in  the  {iz,  ux}  coordinate  system  results  in 

* 

JRn  II (x  -  y)f{y)dy  =  J_  ^  h((x,  u)  -  t)f{tu  +  z)dz j  dt 

/OC 

h((x,  u)  -  t)fu(t)dt 

•OO 

or,  in  more  compact  and  suggestive  notation, 

(3)  H  *  f(x)  =  h*  fu((x,u))  . 

In  other  words,  the  multivariate  convolution  of  H  and  /  evaluated  at  i  is 
equal  to  the  univariate  convolution  of  h  and  /„  evaluated  at  (x,  u).  We  are 
assuming  of  course  that  all  the  functions  involved  are  sufficiently  well  behaved 
so  that  the  integrals  make  sense. 

Formula  (3)  is  elementary  and,  as  we  have  seen,  easily  verifiable.  Never¬ 
theless  it  is  very  useful;  indeed,  it  is  the  basis  of  the  summability  formulas 
given  below.  For  instance  it  should  be  quite  easy  to  see  that  if  a  convolution 
kernel  K  is  a  sum  of  ridge  functions  then  K  *  f  can  be  readily  computed 
from  knowledge  ot  'JZf.  The  definitions  and  formulas  below  are  simply  more 
precise  versions  of  this  observation. 

A  locally  integrable  function  K  on  Rn  is  a  uniform  sum  of  ridge  functions 
if  there  is  an  even  locally  integrable  univariate  function  h  such  that 

(4)  K(x)  =  —  /  h{(x,  u))  d<r(u) 

<7n  ys"-‘ 
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for  almost  all  x  in  Rn.  Here  do(u)  denotes  the  usual  rotation  invariant 
Lebesgue  measure  on  the  unit  sphere  S"-1  in  Rn  and  on  denotes  its  total 
measure. 

Observe  that,  as  a  consequence  of  the  definition,  if  K  is  a  uniform  sum 
of  ridge  functions  then  it  must  be  a  radial  function.  In  other  words,  K(x)  — 
A: ( [a: | )  where  k  is  a  univariate  function. 

The  proposition  below  gives  a  formula  for  K  *  f  in  terms  of  7 Zf.  It  is  a 
transparent  consequence  of  (3),  (-1),  and  an  appropriate  change  in  the  order 
of  integration  which  can  be  justified  via  Fubini’s  Theorem. 

Proposition  1  If  a  locally  bounded  Junction  K  is  a  uniform  sum  of  ridge 
functions  and  f  is  an  integrable  function  with  compact  support  then 

(5)  K*f(x)  =  —[  h*  fu{(x,u))do(u) 

on  Js”-' 

where  h  and  K  are  related  by  (4). 

The  hypothesis  on  /  and  K  in  the  above  proposition  can  be  relaxed  in 
various  ways.  For  example,  if  both  K  and  h  are  integrable  over  Rn  and  R  re¬ 
spectively  then  /  need  not  have  compact  support.  We  will  not  go  into  further 
details  here  since  the  proposition  is  sufficiently  general  for  our  applications. 

Formula  (5)  is  very  suggestive;  it  should  allow  us  to  obtain  summability 
results  analogous  to  the  classical  summability  results  alluded  to  in  the  be¬ 
ginning  of  this  section.  In  view  of  this,  the  question  concerning  what  radial 
functions  are  uniform  sums  of  ridge  functions  is  of  some  interest. 

2.2  Uniform  Sums  of  Ridge  Functions 

To  obtain  an  answer  to  the  question  concerning  what  radial  functions  are 
uniform  sums  of  ridge  functions  we  begin  by  writing 

(6)  K{x)  =  A(|z|) 

to  express  the  relationship  between  the  radial  function  K  and  the  correspond¬ 
ing  univariate  function  k.  Using  this  notation  and  the  Funk-Hecke  theorem, 
we  may  rewrite  formula  (4)  as 

(7)  Jfc(r)  =  —  I'  h{rt){  1  -  <2)(n-3)/2d< 

On  J- 1 

where  r  =  \x\.  This  can  also  be  derived  more  directly  by  using  polar  coordi¬ 
nates  to  express  the  integral  on  the  right  hand  side  of  (4)  as 

o-n_i  f  h(r  cos  9) (s\n9)n~2d9 
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followed  by  the  change  of  variables  i  =  cos  9.  Another  useful  variant  of  (  5) 

(8)  /’ 

anrn  1  Jo 

follows  from  using  the  change  of  variables  s  =  rt  and  the  fact  that  h  is  an 
even  function. 

It  is  clear  from  formula  (8)  that  for  given  k  the  function  h  is  unique.  A 
representation  of  h  in  terms  of  k  follows  from  (8)  and  the  identities  on  pages 
14  and  15  in  [15].  An  alternate  approach  results  from  the  observation  that 
the  change  of  variables  <r  =  s1  and  p  =  r2  reduces  (8)  to  a  standard  half  line 
convolution  equation  which  can  be  solved  by  routine  methods,  for  example, 
Laplace  transforms.  In  any  case  a  representation  of  h  in  terms  of  k  is  given 
by 

(9)  h{t)  =  cj  Jo  (<2  -  s2)(”  3)/V-1*(s)ds 

which  may  also  be  expressed  as 

(10)  MO  =  Cj  t^f  (l  -  s'f‘-3,l2s'-'k(,s)is 

where 

_  _ 2^/ff _ 

r((n  -  l)/2)r(rc/2) 

These  observations  can  be  summarized  as  follows: 

Proposition  2  (i)  If  K  is  a  uniform  sum  of  ridge  functions  then  K  is  radial 
and  the  h  in  representation  (f)  is  unique,  (ii)  Conversely,  if  K  is  a  radial 
function,  K{x)  =  and  k  is  n  —  l  times  continuously  differentiable  then 

K  is  a  uniform  sum  of  ridge  functions.  Furthermore,  the  h  in  representation 
(4)  is  unique  and  can  be  obtained  from  formula  (9)  or  (10). 

The  conditions  on  k  in  the  second  half  of  the  statement  of  the  proposition 
can  be  somewhat  relaxed.  In  the  cases  n  =  2  and  n  =  3  the  details  are  given 
below. 

2.2.1  The  Cases  n  =  2  and  n  =  3 

In  the  case  n  =  2  equation  (9)  may  be  written  more  explicitly  as 
h(i)  =  —  /  ( t2  —  s2)  ^  sk(s)ds  . 

Integration  by  parts  followed  by  the  indicated  differentiation  results  in 

(11)  h(t)  =  k( 0)  +  t  J*  (t2  -  s2)~1/2  dk(s) 
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where  the  integral  is  a  standard  Stieltjes  integral. 

The  above  calculations  are  valid  whenever  k  is  a  function  of  bounded 
variation  on  every  finite  interval  [0,T],  0  <  T  <  oc.  Furthermore,  if  k 
satisfies  this  hypothesis,  it  follows  from  Fubini’s  theorem  that 

J  j  t[i7  —  s2)  ^  dk(s)dt  =  J  ( T 2  —  s2)  ^  dk(s )  . 

By  inserting  absolute  values  and  replacing  =  with  <  in  the  appropriate  places 
in  the  last  formula  one  can  readily  see  that  h  is  a  locally  integrable  function 
and  /0T  \h(t)\dt  is  bounded  by  T  times  the  sum  of  [ Ar(0) |  and  the  total  variation 
of  k  over  the  interval  [0,  T]. 

These  observations  can  be  summarized  by  the  following  addendum  to 
part  (ii)  of  Proposition  2. 

Proposition  3  Suppose  K  is  a  radial  Junction  on  R2 ,  K(x)  =  fc(|z|),  and  k 
is  a  function  of  bounded  variation  on  the  intervals  [0,  T],  0  <  T  <  oo.  Then 
K  is  a  uniform  sum  of  ridge  functions  and  the  h  in  representation  (f)  can 
be  obtained  from  formula  (11). 

In  the  case  n  =  3  equation  (9)  reduces  to 

(12)  h{t)  =  k(t)  +  tk'(t) 

where  k1  denotes  the  derivative  of  k.  From  this  we  may  easily  conclude  the 
following: 

Proposition  4  Suppose  K  is  a  radial  function  on  R 3  such  that  K(x)  = 
£(|:c|)  and  k  is  absolutely  continuous.  Then  K  is  a  uniform  sum  of  ridge 
functions  and  the  h  in  representation  (f)  can  be  obtained  from  formula  (12). 

2.3  Summability  and  reconstruction 

The  above  results  allow  for  a  very  large  class  of  summability  and  reconstruc¬ 
tion  formulas. 

To  see  this,  suppose  K  is  an  integrable  function  on  Rn,  that  is  it  satisfies 


(13) 

/  \K(x)\dx  <  oo 
JR" 

Normalize  K  so  that 

(14) 

and  for  positive  e  write 

/  K(x)dr  =  1 

J  Rn 

(15) 

Kt{x)  =  rnK{x/e) 
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Recall  that  if  /  is  in  Lp(Rn)  for  some  p  which  satisfies  1  <  p  <  oc,  then 
Kt  *  f  converges  to  /  in  Lp(Rn )  as  e  goes  to  zero.  Furthermore  if  K  also 
satisfies 

.  .  |A'(z)j  <  *1* ( ( s' | )  where  <I>  is  a  nonincreasing  function 

'  on  the  interval  0  <  t  <  oo  and  /0°°  4>(r)rn_1dr  <  oo 


then  Kt*  f(x)  converges  to  f(x)  almost  everywhere.  Detailed  proofs  of  these 
facts  can  be  found  in  [52].  Estimates  on  the  rates  of  convergence  of  Kt  *  / 
to  /  can  be  had  in  term  of  various  smoothness  conditions  on  /  and  certain 
moment  conditions  on  K.  If  such  a  /f  is  a  uniform  sum  of  ridge  functions 
then  by  virtue  of  Proposition  1  we  can  easily  obtain  the  type  of  results  alluded 
to  above. 

To  state  some  of  this  more  precisely,  suppose  K  is  a  uniform  sum  of  ridge 
functions.  Then 

(17)  Kt{x)  =  —  f  h€((x,u))da(  u) 

ern 

where  ht(t)  ~  e~nh(t/e)  and  h  is  the  function  in  representation  (4)  of  K . 


Proposition  5  Suppose  K  is  an  integrable  kernel  which  is  a  uniform  sum  of 
ridge  functions  and  which  satisfies  (13),  (14),  and  (16).  Define  the  functions 
Kt  and  ht  via  (15)  and  (17)  and  assume  that  ht  is  integrable.  Suppose  f  is 
an  integrable  function.  Then 

(18)  fim  —  h<  *  /.((*,  u))da(u )  =  f{x) 


for  almost  all  x  in  Rn .  If,  in  addition,  f  is  in  Lp(Rn )  for  some  p  satisfying 
1  <  p  <  oo  then 


lim  f  |/(z)  ~  ~  [ 

<->oo  JRn  crn  jSn-\ 


K  *  /«((z>  u))da {u)\p dx  =  0  . 


Furthermore,  if  f  is  uniformly  continuous,  (18)  holds  uniformly  in  x  for 
x  €  Rn. 


As  a  specific  example  of  Kc  satisfying  the  conditions  of  the  above  propo¬ 
sition  consider  the  Poisson  kernel 

is  r  \  _  _ _ 

el  ]  (e2  +  ]i|2)("+i)/2 

where  c„  is  constant.  That  this  kernel  is  a  uniform  sum  of  ridge  functions 
follows  from  Proposition  2.  The  formula  for  the  corresponding  ridge  function 

Ttq)  ,  ZUf/vg  +  g) 

where  Tn  is  the  classical  Tchebichef  polynomial  of  the  first  kind  of  degree  n, 
see  Subsection  3.2.1. 
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Corollary  1  Suppose  ht  is  defined  by  (19)  and  suppose  f  is  an  intcgrable 
function.  Then 

(20)  f(x)  =  lim  —  f  ht*  /„((*,  u))da(u) 

‘-*00  crn  Js 

for  almost  all  x  in  Rn.  If,  in  addition,  f  is  in  Lp(Rn)  for  some  p  satisfying 
1  <  p  <  oo  then 

lim  /  |  f(x) - —  /  ht*  fu((x,  u))da(u)\vdx  =  0  . 

t-oo  JRn  £Tn  J sn~' 

Furthermore,  if  f  is  bounded  and  uniformly  continuous,  (20)  holds  uniformly 
in  x. 


These  results  can  be  used  to  obtain  elementary  derivation  of  various  vari¬ 
ants  of  the  usual  type  of  “inversion  formulas”.  For  example,  in  the  case  n  =  2 
formula  (19)  gives 


(21) 


MO 


1  e2  —  t2 
2n  (f2  +  t2)2  ' 


Since  hc(t)dt  =  0  we  may  write 


(22)  -fZ  jsx  K  *  fu{{x ,  u))da{u)  = 

=  2?r  Is'  /«,  ~  0  “  /«((*»  u))}dld(r(u) 

or,  using  the  fact  that  is  even, 

(23)  ~  J^h^  fv{(x,u))d<x{u)  = 


-  —  f  f  K(i){fu{(x,u)+t)-2fu((x,u))+fu((x,u)-i)}dtda(u). 
47T  J  S'  J- oo 

Now,  assuming  that  /„  is  sufficiently  smooth,  the  right  hand  side  of  (22) 
may  be  expressed  as 


7—  /  /  /  thsl(t)fl((x,  u)  —  t)dtdsda(u) 
i'K  JS 1  Jo  J- oo 

where  f(  denotes  the  derivative  of  /„.  Since  limt_o^»f(0  =  —  1  /(27rt),  it 
can  be  anticipated  that  if  e  goes  to  0  then  the  innermost  integral  in  the  last 
expression  will  converge  to  the  Hilbert  transform  of  /';  for  examples  of  such 
arguments  see  [53].  This  will  result  in  an  integrand  which  is  independent  of 
the  s  and  so  this  variable  can  be  integrated  out. 

Letting  e  go  to  zero  in  (23)  also  results  in  an  expression  which  is  inde¬ 
pendent  of  e.  We  summarize  these  elementary  calculations  as  follows: 
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Corollary  2  If  f  is  a  function  on  R2  which  is  truce  continuously  differen¬ 
tiable  and  has  compact  support  then 


(24) 


/(*) 


/'((z,u)  -  t) 
i 


dtd<j(u) 


and 


f(x)  = 


-1  r  f°°  fu({x,  V.)  +  0  -  2/„((i,«))  +  /«((*,  u)  -  t)  Jt  t/  s 

i?  L  L - p - itd,rM 


for  all  x.  The  integral  in  the  t  variable  in  (24)  is  to  be  interpreted  in  the 
principal  value  sense  and  /'  denotes  the  derivative  of  fu. 

Formula  (24)  is  one  of  the  standard  variants  of  the  Radon  inversion  formula 
and  may  be  found,  for  instance,  in  [41].  We  have  not  attempted  to  give 
the  most  general  hypothesis  on  /  in  the  last  corollary;  by  going  through 
the  above  calculations  the  reader  should  have  no  difficulty  in  relaxing  the 
conditions  on  /  somewhat.  It  may  be  of  some  interest  to  obtain  the  weakest 
possible  conditions  on  /  for  which  the  formulas  in  the  above  corollary  are 
valid  almost  everywhere. 


3  Examples  and  miscellaneous  results 

The  first  three  subsections  below  contain  examples  of  ridge  function  repre¬ 
sentations  and  alternate  ways  of  computing  such  representations.  The  final 
three  subsections  contain  material  which  will  be  used  in  our  treatment  of 
partial  data. 

3.1  Ridge  function  representations  for  certain  bivariate  kernels 

Because  it  models  certain  non-destructive  imaging  schemes,  the  bivariate 
case  is  particularly  important  in  practical  applications.  For  example  see 
[8,  16,  41].  In  this  subsection  we  give  several  examples  in  the  case  n  =  2  of 
radial  approximations  of  the  identity  and  their  ridge  function  representations. 

3.1.1  Averaging  kernels 

If  K  is  1  fir  times  the  indicator  function  of  the  unit  disk  centered  at  the 
origin  then  Kt  *  f(x)  is  the  average  of  /  over  the  disk  centered  at  x.  The 
corresponding  ridge  function  ht  can  be  easily  derived  from  (11).  Thus  if 

*,(*)  =  (  ll{f)  ifil|S£ 

v  ;  (  0  otherwise 

then  the  corresponding  ridge  function  in  representation  (4)  is  given  by 

h  (t\  =  [  l/M  _  if|<l<£ 

^  1  {l/(7Tf2)}{l  —  \i\/\/t2  —  £2}  otherwise. 
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3.1.2  Multiples  of  (1  +  |x|2)°. 

If  K{x)  =  (1  +  |x|2)a  then  its  corresponding  ridge  function  h(t)  can  obtained 
from  (11)  quite  conveniently.  Indeed, 

k(t)  =  1  +  2a t  f\l  +  s2)Q_1(t2  -  s2)"1/2sds 
Jo 

which,  after  appropriate  change  of  variables,  can  be  expressed  as 

(25)  h(i)  =  1  +  2at(\  +  f)a~l12  -  s2)Q_1ds  . 

Jo 

In  the  cases  when  a  is  an  integer  multiple  of  1/2  the  integral  in  (25)  can 
be  easily  evaluated  and  expressed  in  terms  of  elementary  functions.  For 
example,  in  the  case  a  =  —3/2  formula  (25)  reduces  to 

1  -t2 
(1  +  i2)2  ' 

Thus  if  Kt  is  the  so-called  Poisson  kernel 

K‘(l)  =  S(e  +  |.rr/= 

then  the  corresponding  ridge  function  in  representation  (-1)  is  given  by 

1  e2  —  t2 

=  sunw  ■ 


3.2  Fourier  transforms 

The  notion  of  uniform  sum  of  ridge  functions  expressed  by  (4)  can  be  ex¬ 
tended  to  tempered  distributions  as  follows:  Recall  that  the  usual  way  of 
defining  linear  t-ansformations  on  distributions  is  via  duality  and  observe 
that  the  formal  dual  of  the  mapping 

<f>(t)  — ♦  /  <i>((x,  u))da(u) 

Jsn~' 

which  maps  sufficiently  well  behaved  univariate  functions  into  n- variate  func¬ 
tions  is  the  mapping 

<f>(x)  — »  /  7 l<f>(u,t)d(r(u)  . 

Jsn~ 1 

Hence  it  is  quite  natural  to  define  a  distribution  K  in  S'{Rn)  to  be  a  uniform 
sum  of  ridge  distributions  if  there  is  an  even  distribution  h  in  S'(R)  such 
that 


for  all  &  in  S(Rn).  Here  (a.  3)  denotes  the  action  of  the  distribution  a  on 
the  test  function  (3.  We  also  remind  the  reader  that  <f>u{l)  =  'R.<p(u,t). 

Note  that  for  fixed  u  in  S"_1  the  mapping  <j>  — >  <pu  maps  S(Rn)  contin¬ 
uously  into  S(R).  Thus  the  tempered  distribution  K  is  well  defined  by  (26) 
for  every  even  distribution  h  in  S'(R). 

Having  defined  (4)  distributionally,  its  Fourier  transform  is  also  well  de¬ 
fined  as  follows:  If  K  satisfies  (26)  then  its  Fourier  transform  K  satisfies 

(27)  <A',  <?)  =  —  /  (h,  <j>u)d(r{u) 

for  all  <f>  in  S(Rn).  Here  (£“(<)  =  <f>(tu).  Of  course  K  and  h  are  the  Fourier 
transforms  of  K  and  h  as  distributions  on  Rn  and  R  respectively.  In  the  case 
when  h  is  sufficiently  smooth,  (27)  may  be  expressed  as 


(28) 


m)  = 


mn 

ff.ifi*-1 


3.2.1  The  Poisson  kernel 

Formula  (28)  is  useful  in  determining  the  ridge  function  representation  of 
certain  radial  K's.  For  example  consider  the  Poisson  kernel  which  is  defined 
for  positive  e  by 

C  € 

(29)  hi{X)  =  (e2  +  jx|2)(n  +  l)/2 

and  whose  Fourier  transform  is  given  by 

(30)  Kt{£)  =  (27r)-n/2  e"'1*1  . 

Here  cn  is  a  constant  whose  exact  expression  is 

cn  =  7r<"+,>/2r((n+l)/2). 

The  formula  for  the  Fourier  transform  of  the  corresponding  ht  is  easily  ob¬ 
tained  from  (28)  and(30);  it  is 

(31)  ht(r)  =  (2-)-"/2(7„|rr1e-'lTl. 

To  obtain  a  formula  for  hl  observe  that  (31)  may  be  expressed  as 

K(r)  =  (2!r)-n^2<7„  e-W 

and  recall  that  exp(  — e|r|)  is  the  Fourier  transform  of  a  constant  multiple  of 
e/(c2  4-  t7).  Thus  hc  is  a  constant  multiple  of 

(  d\n-1  €  Tn(c/v/^+l^) 

^  dc)  (f2  +  <2)  C  (e2  +  f2)"/2 


11 


where  the  equality  follows  from  an  induction  argument,  c  is  a  constant,  and 
Tn  is  the  classical  Tchebichef  polynomial  of  the  first  kind  of  degree  n,  namely, 
Tn(cos9)  =  cosnO.  This  constant  can  be  determined  by  evaluating  both  Kt 
and  ht  at  the  origin  which  results  in 


(0  — 


Tn(e/V? TP) 

(e2  +  t2)"/2 


where  cn  is  the  same  constant  as  in  (29). 


3.3  Polynomials  and  ridge  functions 

Since  it  is  quite  transparent  that 


/  (x,  u)7mdcr(u)  =  a(n,  m)b 

Js  »-• 


where  a(n,  m)  is  a  positive  constant,  ridge  function  representations  of  radial 
polynomials  is  rather  appealing.  Thus  if  P  is  any  radial  polynomial 

M 

P(x)  =  £  bM73 


P(x)  =  /  h((x,  u))dcr(u) 

Jsn~l 


where 


MO  =  E 


w 

The  above  formulas  lead  to  several  interesting  relations  between  various 
classes  of  classical  orthogonal  polynomials.  This  can  be  realized  by  observing 
that 

a(n,  m)  =  J'  t2m{  1  - 

and,  as  in  [30],  using  the  representation  of  these  polynomials  in  terms  of  the 
hypergeometric  function.  However  a  more  intuitive  approach  can  be  outlined 
as  follows: 

Let  L7(Bn,  A)  be  the  Hilbert  space  consisting  of  those  measurable  func¬ 
tions  /  on  the  unit  ball  Bn  for  which 


Jb  j/«i2o  -  iip)1* 


is  finite.  This  Hilbert  space  is  equipped  with  the  inner  product 


{f,g)x  =  I  /(*M*)(1  ~  \x\7)xdx 

J  Bn 
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If  Vm  denotes  the  set  of  n- variate  polynomials  of  degree  <  m  then  we  define 
L2{Bn ,  A,  0)  =  V0  and  for  m  >  1 

L2(Bn,X,m)  =  {PeVm  :  {P,  Q)\  -  0  for  all  Q  <E 'Pm_i}  . 

Finally,  recall  the  classical  Jacoby  polynomials  P^'0\t),  m  =  0, 1, 2, . . and 
the  classical  Gegenbauer  or  ultraspherical  polynomials  C^(t),  m  —  0, 1, 2, . . 
these  are  families  of  univariate  polynomials  which  are  orthogonal  on  the 
interval  —  1  <  i  <  1  with  respect  to  the  weights  (l+<)a(l—  t)'3  and  (1— t2)A_1^2 
respectively;  m  denotes  the  degree  of  the  polynomials,  a,  ft,  and  A  are  real 
parameters.  The  notation  for  these  polynomials  is  standard,  see  [11]. 

Proposition  6  (i)  A  radial  polynomial  is  in  L2(Bn ,  A,  2m)  if  and  only  if  it 
is  a  constant  multiple  of  /^A,(n_2I/2l(2|x|2  —  1).  (ii)  .4  ridge  function  is  in 
L2(Bn,  A ,m)  if  and  only  if  it  is  a  constant  multiple  ofCf^nl2((x,  u ))  for  some 
unit  vector  u. 

Proof  If  P  is  a  radial  polynomial  in  V2 m  then  P(x)  =  p(|x|2)  where  p  is 
a  univariate  polynomial  of  degree  m  and  if  Q  is  in  V^m- 1  then 

(P,Q)  »=  /‘p(r,)«(r!)(l-r*)V-I<fr 

Jo 

where  _ 

q(r2)  =  /  Q(ru)do(u) 

Jsn -• 

and  q  is  a  univariate  polynomials  of  degree  <  m  —  1.  Hence 

and,  since  q  may  range  over  all  polynomials  of  degree  <  m  —  1,  (i)  follows  by 
virtue  of  the  fact  that  constant  multiples  of  PJ^’^n_2^/2)(2t  —  1)  are  the  only 
polynomials  of  degree  m  which  are  orthogonal  in  this  sense  to  all  polynomials 
q  of  degree  <  m  —  1. 

If  P  is  a  ridge  polynomial  in  Vm  then  P[x)  =  p{(x,  «))  for  some  univariate 
polynomials  of  degree  m  and  if  Q  is  in  Vm-i  then 

(P,Q) a  =  f_ i  p(i)g(i)dt 

where 

9(1)  =  /  fQ(tu  +  y)(]-P-\y\2)xdy 
Jm  0 

and  -4(<)  =  {y  €  ux  :  |yj2  <  1  —  t2}.  The  the  change  of  variable  y  = 
(1  —  l2)ll2z  allows  us  to  express  g  as 

g{t)  =  (1  -  t2)x+^12  [  Q{tu  +  {l-t2yi2z){\  -  \z\7)xdz  . 

Ja{  1) 
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Since  Q(tu  +  (1  —  i2)1!2  z)  =  Y1  P(1  —  t2)\vd7zv  where  the  sum  is  taken  over 
all  integers  j  and  multi-indices  v  =  (vy, . . . ,  such  that  |i/j  +  j  =  + 

- h  un-\  +  j  <  m  —  1,  the  terms  with  odd  |//|  integrate  out  to  0  and  as  a 

result  g(t)  =  q(t)(l  —  t2)x+(-n~1d2  where  q  is  a  polynomial  of  degree  <  m  —  1. 
Hence 

(P,Q)>  =  j\(t)q(t)(\-i)X+nl2~ll2dt 

and,  since  q  may  range  over  all  polynomials  of  degree  <  m  —  1,  (it)  follows 
by  virtue  of  the  fact  that  constant  multiples  of  C!£+Tl/2\t)  are  the  only  poly¬ 
nomials  of  degree  m  which  are  orthogonal  in  this  sense  to  all  polynomials  q 
of  degree  <  m  —  1.  ■ 


Now,  by  virtue  of  (n)  of  the  above  proposition  fs„- 1  ((x,  u))da(u)  is 

in  L2(Bn ,  A,  2m)  and,  since  it  is  clearly  radial,  it  must  be  a  constant  multiple 
of  P^'<-n~2d2'>(2\x\2  —  1)  view  of  (i)  of  this  proposition.  The  constant  can  be 
easily  evaluated  by  setting  x  =  0  in  both  expressions.  As  a  result  we  may 
conclude  the  following: 

Corollary  3  For  A  >  —  1 

(3J)  /^•("-j)/«(2|xI!  -  1)  =  jsM  c£"l\{x,  u))i*( u) 

3.4  Moduli  of  smoothness,  moment  conditions,  and  degree  of  ap¬ 
proximation 

If  /  is  in  Lp(Rn )  for  some  value  of  p,  1  <  p  <  oo,  its  Lp  modulus  of  continuity 
up{f'i  ^),  6  >  0,  is  defined  by 

wP(/;  6)  =  sup  \f(x  -y)~  f(*)\pdx) 

Observe  that  this  modulus  is  well  defined  and 
(35)  limwp(/;<$)  =  0 

for  all  /  in  Lp(Rn). 

In  the  case  p  =  oo  this  modulus  reduces  to  the  usual  modulus  of  conti¬ 
nuity,  namely, 

^oo (/;£)=  sup  \f{x)-f{y)\. 

\x-y\<6 

Furthermore 

|irnw00(/;5)  =  0 

if  and  only  if  /  is  uniformly  continuous. 


1-f 


We  briefly  summarize  some  of  the  elementary  properties  of  these  moduli. 
In  what  follows  we  take  the  parameter  p  and  function  /  to  be  fixed  and 
consider  u  as  a  function  of  the  parameter  8,  8  >  0;  thus  for  simplicity  we 
write  JL/p{f]S)  =  u(8). 

•  The  function  u(8)  increases  monotonically. 

•  For  non-negative  parameters  8X  and  f2>  ^’(^i  +  62)  <  ^’(^1)  +  ^'(£2)- 

•  For  any  positive  integer  n.  u(n8)  <  nuj(8). 

This  is  an  immediate  consequence  of  the  previous  item. 

•  If  A  >  0,  then  lu(\8)  <  (A  +  l)u;(6). 

To  see  this  choose  an  integer  n  such  that  n  <  A  <  n  -f  1  and  write 

u(A6)  <  u,'((n  T  1)^)  ^  (n  T  <  (A  -f-  1)cl)(6). 

•  If  8l  <  82  then 

u(h)  ^  M61) 

62  ~  «i  ’ 

To  see  this  simply  write 

w(«2>  =  W  1 j  <  +  ll U>(«,)  < 

These  moduli  provide  a  natural  and  useful  measure  of  the  degree  of  ap¬ 
proximation  of  various  processes.  For  example,  suppose  Kc(x),  c  >  0,  is  a 
family  of  integrablc  convolution  kernels  which  satisfy 


(36) 

>; 

TT 

ft. 

H 

II 

k— » 

(37) 

/  \h\(x)\dx  <  ci 

Jr ™ 

and 

(38) 

/  \x\\K((x)\dx  <  c2( 

J  Rn 

where  cq  and  c2  are  constants  independent  of  t.  Then  it  is  not  difficult  to 
show  the  following: 

Proposition  7  If  Kt(x),  e  >  0,  is  a  family  of  integrablc  convolution  kernels 
which  satisfy  (36),  (37),  and  (38)  then  for  any  f  in  Lp(Iln),  1  <  p  <  00, 

\\Kt  *  f  -  fWiPiR")  <  cup{f;  () 
where  c  is  a  constant  independent  of  f  and  e. 
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Proof  By  virtue  of  (36)  we  may  write 

Kt  *  f(x)  -  f(x)  =  f  Kt(y){f(x  -  y)  -  f(x)}dy  . 

JRn 

Direct  estimation  in  the  cases  p  =  1  and  oo  or,  in  the  cases  1  <  p  <  oc,  an 
application  of  the  integral  variant  of  Minkowski’s  inequality  results  in 

\\Ke  *  f-  /lUn/j")  <  /  uP(f;  \y\)\K\{y)\dy  . 

J  ftn 

Finally,  the  estimate 

wp(/;  \y\)  =  up  ^/;  ycj  <  ^  +  1^  w„(/;c) 

together  with  (37)  and  (38)  imply  the  desired  result.  ■ 

Using  the  notation  Ay/(x)  to  denote  the  difference  f(x  —  y)  —  f(x)  and, 
if  k  is  an  integer  >  2,  A yf{x)  —  Ay  ^A*-1/j  (x),  we  define  the  Ar-th  order  Lp 
modulus  of  smoothness  w*(/;  6),  6  >  0,  via 

Up(f>&)  =  SUP  • 

|y|<4 

Note  that  ujl  =  ojp.  In  the  case  k  >  2  the  properties  of  u>p  are  also  well 
known;  they  are  completely  analogous  to  the  case  k  —  1  with  the  obvious 
modifications,  see  [56].  Here  we  only  point  out  the  transparent  facts  that 
\8)  <  2 m~kuk(f\t>)  if  k  <m  and  that  w£(/;£)  can  go  to  zero  as  fast  as 
0(6k )  without  being  identically  zero. 

As  in  the  case  k  =  1,  these  moduli  provide  a  natural  and  useful  measure 
of  the  degree  of  approximation  for  various  processes.  In  the  example  where 
A'((x),  e  >  0,  is  a  family  of  integrable  convolution  kernels,  condition  (38) 
ought  to  be  replaced  with 

(39)  f  |x|*|/fe(a:)|</x  <  c2£fc 

JR" 

where  c?  is  a  constant  independent  of  e,  in  order  to  take  advantage  of  the  po¬ 
tentially  faster  decay  as  e  — >  0.  Then  it  is  not  difficult  to  show  the  following: 

Proposition  8  Suppose  K^x),  e  >  0,  is  a  family  of  integrable  convolution 
kernels  whirh  satisfy  (36),  (37)  and  (39).  Define 

a',*m = Id-  I)*--*-1 7 .  *! ,  (t  -  j)-K,(z/(k  -  A) . 

Then  for  any  f  in  Lv(Rn),  1  <  p  <  oo, 

II A**/-/lliw  <«*{(/; 0 

where  c  is  a  constant  independent  of  f  and  e. 


Proof  By  virtue  of  (36)  and  the  nature  of  Kk  we  may  write 

N't  «  f{x)  -  fix)  =  /  /f«(y)A;/(x)rfy  . 

The  rest  of  the  proof  follows  along  the  same  lines  as  that  of  the  previous 
proposition.  ■ 


Before  leaving  this  section,  we  remark  that  if  A'(x)  is  an  integrable  func¬ 
tion  which  satisfies  (13),  (14),  and 

(40)  f  \x\k\Kix)\dx  <  oo 

jRn 

then  the  family  of  convolution  kernels  A\(x)  =  e_nA'(z/f)  satisfies  (36),  (37), 
and  (39). 

3.5  Polynomial  convolution  kernels 

In  many  problems  of  practical  and  theoretical  interest  the  phantom  /  is  taken 
to  have  support  in  a  fixed  disk  or  ball  of  finite  radius,  say  1/2,  about  the 
origin.  In  this  case  consideration  of  K  *  fix)  for  x  in  f?”/2  =  {x  €  Rn  : 
jij  <  1/2}  does  not  involve  the  behavior  of  K  outside  the  ball  of  radius  one 
about  the  origin.  Similarly  for  such  /  and  x  the  calculation  of  the  quantities 
on  the  right  hand  side  of  formulas  (5)  does  not  involve  the  behavior  of  h 
outside  the  interval  —  1  <  t  <  1.  In  this  case  the  criteria  that  K  be  a  good 
approximation  of  the  identity  can  be  modified  to 


(41) 

/  Kix)dx  =  1 

J  Bn 

and 

(42) 

/  \Kix)\dx  <  e 

y«<ixi<i 

for  sufficiently  small  8  and  e.  A  wide  class  of  such  kernels  are  given  by  certain 
radial  polynomials.  In  view  of  the  fact  that  ridge  function  representation  of 
such  polynomials  is  very  appealing,  the  corresponding  kernels  should  have 
significant  applications  in  tomography. 

There  are  many  ways  of  producing  radial  polynomial  kernels  which  behave 
like  good  approximations  of  the  identity.  A  possible  method  of  producing 
good  kernels  suitable  lor  numerical  work  can  be  based  on  the  ChristofTel- 
Darboux  formula  for  orthogonal  polynomials. 

For  our  purposes,  summability  and  degree  of  approximation,  we  need 
polynomial  kernels  which,  in  addition  to  (41),  enjoy  appropriate  analogues 
of  (16),  (37),  and/or  (39).  These  properties  are  easily  verified  for  the  families 
given  below. 
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3.5.1  Kernels  for  summability 


A  family  of  radial  polynomials  which  can  be  easily  shown  to  possess  an 
analogue  of  (16)  is  the  following: 

Consider  the  sequence  of  polynomials  Pm,  m  —  0, 1, 2, . . .,  defined  by 


(«) 


Pm(x)  =  c(n,  m)(l  -  |x|2)m 


where 


(44) 

c(n,  m)  Ib 

x\ 2)mdx  = 

_  fn 
2 

•  i  -  t)mtn'2-idi 

Jo 

is  a  constant  chosen  so  that 

(45) 

f  Pm{x)dx  =  1 

Jb n 

for  all  m. 

Observe  that 

crn  T(m  +  l)T(n/2) 
2  T(m  +  nj 2  +  1) 


(1  —  |x|2)m  <  e~m I*!3  whenever  |x|  <  1 


and 


c(n,  m)  <  c  mn^2 


where  c  is  a  constant  independent  of  m;  the  last  inequality  follows  easily 
from  the  definition  of  the  gamrna  function  T  in  the  case  of  even  n  and  by 
virtue  of  Stirling’s  formula  in  the  case  of  odd  n.  These  inequalities  allow  us 
to  conclude  that 


(46)  Prn(x)  <  cmnl2e  whenever  |x|  <  1 


where  c  is  a  constant  independent  of  m. 

Observe  that  (46)  is  a  special  example  of  (16)  in  the  case  e  =  m"1^2  with 
the  restriction  that  |x|  <  1.  Hence  we  may  conclude  the  following: 

Proposition  9  Suppose  f  is  an  integrable  function  with  support  in  B ”^2, 
in  other  words  f  €  Zd(5"/2),  and  Pm,  m  =  0,1,2,...,  is  the  sequence  of 
polynomials  defined  by  (f3).  Then 


(47)  lim  Pm  *  f(x)  =  f(x) 

m— *oo  v  2 

for  almost  all  x  in  Bnlj2.  If,  in  addition,  f  is  in  Z.p(B"/,2)  for  some  p  satisfying 
1  <  p  <  oo  then 

lim  /  \f(x)  -  Pm  *  f(x)\pdx  =  0  . 

m— *  oo  /on 
U\/2 

Furthermore,  if  f  is  continuous,  (47)  holds  uniformly  in  x  for  x  €  B'f^. 
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3.5.2  Degree  of  approximation 

Unfortunately,  the  simple  family  of  polynomial  kernels  defined  by  (43)  is  not 
sufficient  to  obtain  the  “correct”  degree  of  approximation  results.  What  is 
needed  is  a  family  which  in  addition  to  satisfying  (45)  also  satisfies  moment 
conditions  of  the  form 

/ 

J  Bn  mK 

where  c  is  a  constant  independent  of  m.  I  know  of  no  elementary  formulas 
for  such  polynomials.  Theoretically  the  most  convenient  and  least  technical 
construction  seems  to  be  the  following: 

The  idea  is,  for  each  positive  integer  k ,  to  construct  a  sequence  of  positive 
radial  polynomials,  Pm,  m—  1,2,...,  where  Pm  is  of  degree  no  greater  than 
2m,  which  satisfy 

(48)  f  \x\2k  Pm(x)dx  <  — /  Pm{x)dx 

Jb "  m  JBn 

where  c  is  a  constant  independent  of  m. 

Using  polar  coordinates  and  the  change  of  variable  /  =  r2  it  is  clear  that 
it  suffices  to  construct  a  sequence  of  positive  univariate  polynomials  pm  such 
that 

(49)  /l  pm{t)lk+{n~2)l2dt  =  -%■  /l  Pm{t)t{n~2)/2dt 

Jo  mi  Jo 

where  pm  is  of  degree  no  greater  than  m  and  c  is  a  constant  independent  of 
m. 

The  construction  of  such  a  sequence  pm  is  outlined  as  follows: 


•  Let  N  be  the  integer  defined  by 


N  = 


(n  —  2)/2  if  n  is  even 
(n  —  l)/2  if  n  is  odd 


and  let  /?  =  (n  —  2)/2  —  jV.  Note  that  0  =  0  or  —  1  /2. 
•  Let  £  be  the  integer  defined  by 


£= k+N . 


•  Let  M  be  the  integer  defined  by 


M  = 


(£  +  m  +  2)/2  if  l  +  m  is  even 
{£  +  m  +  l)/2  if  £  +  m  is  odd. 
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•  Let  0  <  2,i/(l)  <  ~a/(2)  <  . . .  <  zm(M)  <  1  be  the  zeros  of  P^f'0\2t—  1) 
where  Pfu’^  is  the  classical  Jacoby  polynomial  of  degree  M  and  recall 
that  for  fixed  j 

(50)  zM{j)  <  Jp 

where  c  is  a  constant  independent  of  M .  See  [55]. 

•  Define 

(51)  Pm (0  =  FI  (*  ~  zmU))2 

j=Z+l 

and  observe  that  pm(t)  <A  and  pm(t)ik+N  are  polynomials  of  degree 
no  greater  than  2M  —  1  so  that  Gaussian  quadrature  with  knots  at 
zm{  1),  zm( 2), . . . ,  z,w(M)  can  be  used  to  evaluate  the  integrals  in  (49). 

•  Finally,  inequality  (49)  follows  from  the  following  strings  of  equali¬ 
ties/inequalities  and  (50): 

f  =  /'  Pm(t)iN*s  it  =  't\„U)rAH,U))‘MU)N 

Jo  Jo  jr] 

and 

/  Pm{t)tk+{n~2)l2dl  =  f  pm{t)tk+N+0dl  =  Yl^l(j)Pm{Z')ZMU)k+N 

Jo  Jo 

<  *m(0*  |lI^/(i)Pm(2j)zArO')N|  • 

The  properties  of  the  sequence  of  polynomials  just  constructed  together 
with  the  methods  of  Subsection  3.4  allow  us  to  conclude  the  following: 

Proposition  10  Suppose  Pm{x),  m  —  1,2,...,  is  the  family  of  positive  ra¬ 
dial  polynomials  defined  by  Pm(x)  =  pm(|z|2)  where  pm  is  the  univariate 
polynomial  given  by  (51).  Define 

etw  =  £(-i)^-  7 r4ri(t-)rw(t-)))  • 

j—0  J)J- 

Then  is  a  polynomial  of  degree  no  greater  than  2m  and  for  any  f  in 
LF(B”j2),  1  <  p  <  oo, 

||/^-/-/llw1V<cW;(/;l/m) 

where  c  is  a  constant  independent  of  f  and  m.  In  other  words,  for  fixed  k 
the  sequence  of  polynomials  P^,*  f,m—  1,2,...,  converges  to  f  in  Lp{B*j2) 
at  a  rate  which  is  0(uik(f ;  1/m))  as  m  — ►  oo. 
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4  Incomplete  data 

In  this  section  we  consider  summability  and  approximate  reconstruction 
when  the  data  fu  are  known  only  for  u’s  in  a  proper  subset  A  of  Sn_1.  Ap¬ 
plying  the  same  philosophy  used  in  the  previous  section  immediately  leads 
to  the  question  concerning  what  summability  kernels  K  can  be  expressed  as 
sums  of  ridge  functions  in  directions  u  which  lie  in  A.  It  is  to  be  expected 
that  the  class  of  kernels  which  enjoy  such  a  representation  is  somewhat  more 
restricted  than  in  the  case  of  full  data. 

In  the  case  when  A  is  a  finite  set  the  results  of  [37]  show  that  the  only 
radial  kernels  which  can  be  expressed  in  terms  of  the  corresponding  ridge 
functions  are  polynomials.  To  be  more  specific,  let  A  =  {«i,...,ujv}  and 
suppose  K  is  a  radial  function  such  that 

K(x)  =  £  A, ((*,«.» 

t=i 

for  appropriate  univariate  functions  /h, . . . ,  Ajv.  Then  K  must  be  a  polyno¬ 
mial  of  degree  no  greater  than  a  certain  bound  determined  by  A.  Further¬ 
more,  this  will  be  significant  in  our  considerations  below,  the  /i,’s  can  be  cho¬ 
sen  to  be  scalar  multiples  of  one  univariate  function  h,  namely,  h{(l)  =  W{h[t ), 
z  =  l,...,7V. 

Motivated  by  these  and  related  considerations  we  restrict  our  attention 
to  radial  polynomial  kernels  in  what  follows. 

4.1  Quadrature  on  the  sphere  and  ridge  function  representations 

Our  summability  formulas  are  based  on  the  following: 

Proposition  11  Suppose  A  is  a  measurable  subset  of  S’1*1  and  p  is  a  bounded 
Borel  measure  on  A.  The  following  are  equivalent: 

(i).  There  is  a  constant  c  so  that  the  formula 

(52)  /  p(u)dp(u)  =  c  /  p(u)dcr(u) 

Ja  Jsn~' 

holds  for  all  even  polynomials  of  degree  <  2m. 

( it ).  Formula  (52)  holds  for  all  homogeneous  polynomials  of  degree  =  2m. 
(in).  There  is  a  constant  c  so  that  the  formula 

(53)  f  (x,u)?k dp(u)  —  c  f  (x,u)2kdo(u) 

Ja  Jsn-> 

holds  for  all  x  when  k  =  m. 
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( iv ) .  Formula  (53)  holds  for  all  x  and  all  integers  k  which  satisfy  0  <  k  <  m. 

In  other  words 

(54)  f  p((x,  u))dfi(u)  —  c  f  p((x,u))dcr(u) 

JA  Jsn~] 

for  every  even  univariate  polynomial  p  of  degree  <  2m. 

(v) .  There  is  a  nonzero  constant  c  such  that 


and 

[  Y (u)dp(u)  =  0 
J A 

for  every  spherical  harmonic  Y  of  degree  2k,  k  =  1, . . . ,  m. 

(vi).  For  every  radial  polynomial  P  of  degree  <  2m  there  is  a  univariate 
polynomial  p  with  the  same  degree  as  P  such  that 

(55)  P(x)  =  J ^  p((x,  u))dp(u) 

holds  for  all  x.  Furthermore,  the  formula  for  p  in  terms  of  P  may  be 
derived  from 

(56)  P(x)  =  crn_x  J  i  p(|i|<)(l  -  Pfn~3)l7dt  . 

The  proof  of  this  proposition  is  contained  in  subsection  4.1.2.  Here  we 
merely  give  a  related  definition  which  will  be  useful  later. 

Motivated  by  the  above  proposition  we  say  that  a  subset  A  of  Sn~l  is 
in  Q(m)  if  there  is  a  bounded  measure  p  on  A,  which  depends  on  A  of 
course,  such  that  (52)  holds  for  all  polynomials  homogeneous  of  degree  2m, 
m  =  0,1,2, ...  .  In  other  words,  (2(m)  is  the  collection  of  all  measurable 
subsets  A  of  S"_1  which  admit  a  measure  p  for  which  ( ii )  of  Proposition  11 
holds.  For  convenience  we  set  Q(oo)  =  flm=o  Q{m)- 

The  significance  of  (3(77*)  in  our  application  is  the  fact  that  property  (ui) 
of  Proposition  11  holds  for  each  of  its  members. 

We  bring  to  the  reader’s  attention  the  following  list  of  transparent  obser¬ 
vations  concerning  Q(m ): 

•  Sn~l  is  contained  in  Q{m)  for  each  m,  m  =  0,1,2,...  ;  namely,  S"'1  € 
<2(oc). 

•  Q(0)  contains  every  non-empty  subset  of  S"-1. 

•  Q{m  +  1)  C  Q{m)  for  all  m. 

•  If  A  G  Q.{m)  and  B  D  A  then  B  €  Q(m). 

•  If  -4  is  in  Q(m)  then  the  measure  p  associated  with  A  whose  existence 
is  guaranteed  by  the  definition  need  not  be  unique. 
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4.1.1  Examples 

Suppose  A  is  an  open  subset  of  5"-1.  It  is  not  very  difficult  to  see  that 
A  €  Q{oc). 

Indeed,  to  see  this,  let  V{m)  be  the  space  of  polynomials  homogeneous 

of  degree  2m  and  let  {p,q)a  be  the  bilinear  form  defined  on  V{m)  via  the 

formula  _ 

(p,q)a  =  /  p{u)q(u)da(u)  . 

J  A 

Clearly  (p,q)A  is  a  positive  definite  bilinear  form  and  thus,  equipped  with 
this  form  as  an  inner  product,  V(m)  is  a  Hilbert  space.  Now  consider  the 
linear  form 

I  p(u)da(u) 

Jsn~l 

which  is  well  defined  on  V(m).  Elementary  Hilbert  space  theory  now  implies 
that  there  is  a  homogeneous  polynomial  of  degree  2m,  call  it  qm ,  such  that 

(P,  <lm)A  =  /  ,  p{u)dcx(u)  . 

Jsn~l 

We  summarize  these  observations  as  follows: 

Proposition  12  If  A  is  an  open  subset  of  Sn~l  then  A  €  Q(m)  for  every 
m,  m  =  0,  1,2,...  ;  in  other  words  A  €  (2(c>o).  Furthermore  the  measure 
p  which  does  the  job  for  a  given  m  can  be  taken  to  be  dp(u)  =  qm(u)da(u) 
where  qm  is  a  homogeneous  polynomial  of  degree  2m. 

In  the  case  when  T  is  a  finite  subset  of  Sn_1  ( v)  of  Proposition  1 1  can  be 
useful  in  determining  whether  T  is  in  Q(m). 

To  wit,  let  T  =  {«!, . . . ,  Uni}  then  any  measure  p  on  T  is  determined  by 
weights,  or  scalars,  wlt ,  wm  and 

M 

/  p(u)dp{u)  =  piu^Wj  . 

Jj:  i= i 

Now,  if  Ykt,  k  =  1, . . . ,  N(n,  (),  is  an  orthonormal  basis  for  the  space  of 
spherical  harmonics  on  S'*-1  which  are  homogeneous  of  degree  l  then  by 
virtue  of  (v)  of  Proposition  II  we  may  conclude  that  T  is  (2(m)  if  and  only 
if  the  system  of  equations 

M  M 

(57)  wi  ~  1  and  51  Yh((uj)Wj  =  0 

j=i  j=i 

where 

k  =  1, . . . ,  N (n,  £)  and  l  =  2, 4, . . . ,  2m 
has  a  solution  wit. w^. 
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We  remind  the  reader  that  this  system  has 


L(n,  2m.) 


(2m  +  n  —  1)! 
(2m)!(n  —  1)! 


equations.  However  the  number  of  elements  M  in  T  need  not  necessarily  be 
>  L(n,  2m)  in  order  for  a  solution  w i, . . . ,  w\j  of  (57)  to  exist.  For  example, 
in  the  case  n  =  2,  L( 2,  2m)  =  2m+  1;  but  if  Uj  =  (cos  9j,  sin  9j),  j  =  1, . . . ,  M, 
are  uniformly  spaced,  e.g.  9j  =  (j  —  \)~/M ,  then  it  suffices  to  have  M  > 
m  +  1  to  guarantee  the  existence  of  a  solution  for  (57).  This,  of  course,  is 
reminiscent  of  classical  Gaussian  quadrature. 

We  also  remark  that  in  the  case  n  =  2  any  collection  T  =  {iq, . . . ,  itju}  of 
distinct  directions  is  in  Q(m)  whenever  M  >  2m  +  1.  (By  distinct  directions 
here  we  mean  u,  ^  ±uj  whenever  i  ^  j.)  In  the  general  case  it  is  also 
true  that  most  finite  subsets  consisting  of  >  L(n,  m)  directions  are  in  Q(m). 
However,  determining  them  is  not  so  simple;  one  needs  to  verify  (57)  or  some 
other  equivalent  system. 

To  conclude  this  paragraph  we  summarize  the  main  result  concerning 
finite  subsets  of  Sn_1  as  follows: 


Proposition  13  Suppose  T  —  (iq, ...  ,«*/}  is  a  finite  subset  of  Sn  l.  Then 
T  is  in  Q(m )  if  and  only  if  the  system  of  equations  (57)  has  a  solution 


4.1.2  Proof  of  Proposition  11 

The  proposition  is  an  easy  consequence  of  the  following  strings  of  implica¬ 
tions:  (i)  =$►  (a)  =>  (Hi)  =>  (iv)  =>  (v)  =S>  ( i )  and  (iv)  =>  ( vi )  =>  ( iii ). 

To  wit,  (*)  =>  (a)  =>  (ni)  is  clear  since  each  condition  is  apparently 
weaker  than  its  predecessor.  Applying  the  Laplacian  in  x  to  (53)  in  the  case 
k  =  m  results  in  (53)  in  the  case  k  =  m  —  2  and  successive  applications  of 
the  Laplacian  show  that  (iii)  =>  (iv). 

To  see  that  (iv)  =>  (v),  recall  that  any  spherical  harmonic  of  degree  £  can 
be  expressed  as 

Y((u)  =  c(n,£)  [  Yi(v)C{tn~2)l2((v,u))d(T(v) 

Jsn-' 

where  Cf  is  the  Gegenbauer  polynomial  of  degree  £  with  respect  to  the  weight 
(1  -M2)a-1/2  and  c(n,£)  is  a  constant  which  depends  only  on  n  and  £.  Hence 
we  may  write 

(58)  Yt(u)dfi(u)  =  c(n,  £)  V>(i>)  ^  Ci"-2)/2((v,  u))dp(u)j  do(v)  . 
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Now,  if  l  —  2k,  k  =  1, . . . ,  m,  by  virtue  of  hypothesis  (zu)  the  inner  integral 
on  the  right  hand  side  of  (58)  can  be  replaced  with 

/  ,  C{2nk~2)l2((v,  u))dtr(u)  , 

which  is  equal  to  0  if  k  =  1 , . . . ,  m,  and  an  if  k  =  0.  This,  of  course,  implies 
the  desired  result. 

To  see  that  (u)  =>  (i),  recall  that  any  homogeneous  polynomial  p  of  degree 
2k  can  be  expressed  as 


p{x)  =  |z]2fc  ^a0  +  £*2j(x/M)J 

where  a0  is  a  constant  and  Y2j  is  a  spherical  harmonic  of  degree  2 j.  Hence 

(59)  /  p(u)da(u)  =  a0cr„ 

Jsn~ 1 

and,  by  virtue  of  ( v ),  there  is  a  constant  c  such  that 


(60) 


jf  p(u)dp(u)  =  a0ccrn  . 


A  comparison  of  (59)  with  (60)  and  a  linearity  argument  imply  the  desired 
result. 

Finally,  to  see  the  second  string  of  implications,  observe  that  (iu)  implies 
that  there  are  positive  constants  a0,...,am  such  that 


J^{x,u)7kdp{u)  =  ak  | 


l2fc 


for  k  =  0, . . . ,  m.  Hence  if  P  is  any  radial  polynomial  of  degree  <  2m,  say 
P(x)  =  oMxl2j>  then  (55)  must  hold  with  p(t)  =  ZjLo(^'/aj)<2j-  Thus 
(zu)  =>  (vi). 

Now,  if  (vi)  holds,  equating  the  highest  degree  terms  in  (55)  immediately 
implies  (m). 


4.2  Summability,  reconstruction,  and  degree  of  approximation 

Proposition  1 1  together  with  the  large  variety  of  available  polynomial  summa¬ 
bility  kernels,  some  of  which  were  described  in  Section  3.5,  allow  us  to  easily 
derive  summability  and  reconstruction  methods  for  Radon  transform  data  of 
the  form  {7 Z  f(u,t)  :  — oo  <  t  <  oo,  u  €  .4},  where  A  is  a  proper  subset  of 
5n_l  and  the  phantom  /  is  assumed  to  have  compact  support. 

The  following  are  examples  of  the  type  of  reconstruction  methods  which 
are  consequences  of  the  developments  outlined  above: 
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Proposition  14  Suppose  A  is  a  subset  of  Sn~ 1  which  is  in  Q(oc)  and  f  is 
an  integrable  function  with  support  in  B"j2.  Let  pm,  m  =  1,2,...,  be  the 
sequence  of  univai  iute  polynomials  defined  by 

mo = cj  (i  -  ^)("-3),%n-,(i  _  {isYrds . 

Let  /im  be  a  measure  such  that 

(  P(u)dpm(u)  =  f  P(u)da(u ) 

JA  JSn~l 

holds  for  all  polynomials  homogeneous  of  degree  2m.  Then 

(61)  f(x)  =  lim  —  /  pm*  fu((x,u))dpm(u) 

m— *°°  <Tn  JA 

for  almost  all  x  in  B"y2.  U>  addition,  f  is  in  Lp(B”y2)  for  some  p  satisfying 
1  <  p  <  oo  then 

lim  /  |/(x)  -  —  /  pm  *  /u((x,  u))dpm(u)\pdx  =  0  . 

m-°°  Jb"/2  crn  JA 

Furthermore,  if  f  is  uniformly  continuous,  (61)  holds  uniformly  in  x  for 
x  €  5"/2- 

Note  that  the  summability  formula  (61)  is  a  reconstruction  formula  for  /  in 
terms  of  the  data  {'R.f(u,  t)  :  — oc  <t<  oo,  u  €  ,4}. 

Recall  that  if  A  is  an  open  subset  of  5"_1  then  the  measure  in  the  last 
proposition  can  be  taken  to  be  dpm(u)  =  qm(u)d(r(u)  where  qm  is  a  poly¬ 
nomial  homogeneous  of  degree  2m.  Now,  if}*/,  k  =  1, . . . ,  N  (n,  £),  is  an 
orthonormal  basis  for  the  space  of  spherical  harmonics  on  5n_1  which  are 
homogeneous  of  degree  £  then  the  polynomial  qm  may  be  expressed  as 

N(n,t) 

(62)  qm(u)  =  a0  +  Y  Y  aktYu(u) 

t= 2,4 . 2m  k  =  \ 

as  a  function  on  the  unit  sphere  where,  by  virtue  of  (Hi)  of  Proposition  11 
and  the  construction  of  qm,  the  constants  a^t  may  be  calculated  by  solving 
the  system  of  equations 


,  "(**)  , 

(63) 

do  /  da(u)  -1-  Y  Y  Ukl  Yki(u)dcr(u)  =  1 

JA  1=2,. ..,2m  fc  =  l  JA 

and 

N(n,/) 

(64)  a0  / 

Yij(u)d(r(u)  +  Y  Y  au  1  5«j(u)l rki(u)da(u)  = 

J  A 

1  1=2,..., 2m  k=l  JA 

where  i  =  1, 

. . . ,  N(n,  j)  and  j  =  2,4,...,  2m. 
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Proposition  15  If  A  is  an  open  subset  of  Sn~l  then  the  conclusions  of 
Proposition  lJt  hold  with  dpmiu)  =  qm(u)da(u)  where  qm  is  a  polynomial 
defined  by  (62),  (63),  end  (64)- 

Typical  results  concerning  degree  of  approximation  are  the  following: 

Proposition  16  Suppose  A  is  in  (2(ro)  and  f  is  in  Lp{Bff)  for  some  p, 
1  <•  P  Si  30 •  Let  p  be  a  measure  such  that 

[  P(u)dp(u)  =  c  (  P(u)do(u) 

Ja  Js n-‘ 

holds  for  all  polynomials  homogeneous  of  degree  2m.  Then  there  is  a  uni¬ 
variate  polynomial  Q  of  degree  no  greater  than  2m  such  that 

(65)  ||/(x)  ~  jf  Q  *  /„((x,  u))dp(u) ||Lp(x€b1"/j)  <  cup{f,  1/m) 

where  top  is  the  Lp  modulus  of  continuity  of  f  and  c  is  a  constant  independent 
of  A,  f ,  and  m.  More  generally,  given  any  positive  integer  k,  there  is  a 
univariate  polynomial  Q  of  degree  no  greater  than  2m  such  that 

(66)  ||/(:r)  ~  JaQ*  /u«x,  u))d/r(u)||LP(xeBnj)  <  cukp{f,  1/m) 

where  ujp  is  the  k-th  order  Lp  modulus  of  smoothness  of  f  and  c  is  a  constant 
independent  of  A,  f,  and  m. 

Polynomials  Q  which  do  the  job  in  the  above  proposition  can  be  constructed 
via  the  methods  of  Subsection  3.3  combined  with  those  of  Subsection  3.5. 
For  example,  given  an  integer  k  let  be  the  polynomial  defined  in  the 
statement  of  Proposition  10.  If  pm(|x|)  =  P*(z)  then  the  polynomial  Q 
defined  by 

does  the  job  in  (66). 

Inequalities  (65)  and  (66)  can  be  viewed  as  approximate  reconstruction 
formulas  for  /  in  terms  of  the  data  { Tlf(u ,  t )  :  -00  <  t  <  00,  u  €  A}.  Note 
that  the  error  bound  holds  for  all  and  depends  only  on  /  and 

m. 

It  should  be  mentioned  that  given  data  of  the  form  {7v/(u, /)  :  —00  < 
t  <  00,  u  <E  A },  where  A  is  in  Q(m)  it  is  not  possible  to  obtain  approximate 
reconstruction  algorithms  which  will  give  better  degree  of  approximation 
than  that  given  by  (66).  Thus  the  result  in  Proposition  16  is  best  possible 
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in  the  following  sense:  There  are  examples  of  ^4  in  Q.{m)  and  functions  /  in 
Ip(fi”/2)  such  that  fu  =  0  for  all  u  in  A  and 

\\f\\L-H(f>  1/H  >  e 

where  c  is  a  constant  independent  of  m. 

The  approximate  reconstruction  formulas  described  above  rely  on  the 
fact  that  a  radial  polynomial  P(x)  of  degree  <  2m  enjoys  the  ridge  function 
representation  described  by  (55)  whenever  A  is  in  Q(m).  It  is  clear  that 
the  corresponding  ridge  function  Q(t )  can  be  obtained  from  (10)  or  one  of 
its  variants.  However,  for  numerical  work  it  is  usually  more  convenient  to 
obtain  a  representation  of  P  in  terms  of  orthogonal  polynomials  of  the  form 

P(x)  =  f;6JPjA'(n-2)/2)(2|x|2-l) 
i=o 

for  some  A  >  —  1.  Then  use  (34)  to  obtain 

m 

<?W  =  E  hCi!n'\t) 

j= o 

which  can  be  efficiently  evaluated  via  the  three  term  recurrence  relation. 
Explicit  numerical  examples  will  appear  elsewhere. 

5  Miscellaneous  rerxarks 

Introductory  material  concerning  the  Radon  transform  and  its  applications 
can  be  found  in  [8,  16,  27,  41,  47]  and  the  pertinent  references  cited  there. 
Our  use  of  the  term  ridge  function  is  adopted  from  [23]. 

The  notion  of  representing  kernels,  K,  in  terms  of  ridge  functions  to 
obtain  approximate  reconstruction  formulas  has  been  used  for  quite  some 
time  under  various  guises,  see  [4,  7,  23,  47].  For  example,  the  so-called  filtered 
back projection  method  is  a  concrete  and  numerically  useful  manifestation  of 
it,  see  [47,  48,  49,  16,  41]. 

It  should  be  clear  that  in  order  to  express  a  convolution  K*  f(x)  in  terms 
of  the  full  Radon  transform  data  of  /  it  suffices  to  express  K  as  a  sum  of 
ridge  functions.  For  example, 

(67)  K(x)  =  f  h(u,  (x,  u))da(u) 

Js  n-i 

is  such  an  expression;  here  the  ridge  functions  vary  from  direction  to  direc¬ 
tion.  Representation  (67)  holds  for  most  reasonable  functions,  at  least  in 
some  wide  sense,  with  h  given  by 

(68)  h(u,  (x,  ti»  =  (27r)~n/2  f°°  K{ru)ei{x'u)Trn-1dr 
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where  K  is  the  n  variate  Fourier  transform  of  K.  Note  that  the  pair  (G7) 
and  (68)  is  simply  a  variant  of  the  Fourier  inversion  formula.  The  restric¬ 
tion  to  uniform  sums  allows  for  an  elementary  development  of  the  desired 
summability  formulas. 

The  results  in  Sections  2. 1-3.2,  restricted  to  the  case  n  =  2,  together  with 
related  material  and  more  examples  were  recorded  and  circulated  in  [34];  the 
key  ideas  were  announced  in  [35].  Some  of  this  material  was  published  using 
different  notation  in  [12,  13];  1  gratefully  acknowledge  the  referee  for  pointing 
out  these  references  and  making  other  constructive  comments. 

Radial  kernels  have  been  used  in  tomographic  applications  by  other  au¬ 
thors,  most  notably  [20,  50],  However,  the  application,  perspective,  and 
basic  results  in  [20,  50]  are  significantly  different  from  ours.  For  example, 
the  formula  for  h  in  terms  of  K  is  based  on,  in  our  notation, 

■R{K~f){u,i)  =  A'„*/»(0 

and 

f{x )  =  A"-1  I  R  f{u,  <x,  u))d(r{u ) 

to  get 

(69)  h{t)  =  or  \n~lHK(u,t) 

where  A”-1  is  the  pseudo-differential  operator  whose  symbol  is  a  constant 
multiple  of  |£|n_1.  Note  the  similarity  between  (69)  and  (28).  Formula  (69) 
appears  to  require  conditions  on  K  which  are  more  elaborate  than  those  in 
Proposition  3  in  order  to  make  sense  pointwise.  Furthermore,  the  derivation 
of  (69)  uses  an  inversion  formula  whereas  our  development  leads  to  various 
inversion  formulas.  I  gratefully  acknowledge  Don  Solmon  who  brought  my 
attention  to  these  matters  and  suggested  the  comments  in  this  paragraph. 

The  uniform  ridge  function  representation  of  the  bivariate  Poisson  kernel 
can  be  found  in  [47].  Additional  examples  of  uniform  ridge  function  repre¬ 
sentations  may  be  found  in  [20,  34,  50], 

Observe  that  (5)  may  be  regarded  as  a  variant  of 

(■&■/!)  *  /  =  R:{h*nf) 

where  TV  is  the  formal  adjoint  of  71.  In  this  formulath  n  it  has  been  suc¬ 
cessfully  applied  in  other  more  general  contexts  of  integial  geometry,  see 
[2,3,14]. 

Various  inversion  formulas  are  well  known  for  full  Radon  transform  data, 
see  [41]  and  the  notes  and  references  cited  there;  for  example,  [8,  15,  16,  17, 
20,  47,  50,  51]. 

Polynomials  arise  quite  naturally  in  the  study  of  the  Radon  transform, 
see  [41]  and  the  notes  and  references  cited  there.  For  example  they  occur  in 
in  singular  value  decompositions  [7,  24,  28,  41,  44],  in  optimal  reconstruction 


29 


in  the  L 2  sense  [23],  and  in  many  other  contexts  [5.  8.  2T  30.  31.  32.  33,  36, 
37,  39,  41].  We  note  that  the  results  of  [33]  show  that  the  output  of  any 
reconstruction  algorithm  which  uses  Radon  transform  data  discretized  in  the 
angular  variable  and  is  both  rotation  and  translation  invariant  in  a  certain 
mild  physically  meaningful  sense  must  be  the  convolution  of  the  phantom 
with  a  polynomial.  The  development  in  Section  3.3  is  simply  an  extension 
of  some  of  the  results  in  [32]. 

Moduli  of  smoothness  provide  a  natural  and  useful  measure  of  the  de¬ 
gree  of  approximation  for  various  processes.  For  classical  results  see  [12,  56]. 
These  moduli  can  be  used  to  define  precise  and  meaningful  notions  of  reso¬ 
lution  for  very  wide  classes  of  approximates.  We  will  not  go  into  the  details 
here.  However  we  do  mention  that  if  Q  is  a  sufficiently  well  behaved  set  in 
Rn,  such  as  a  ball  or  parallelepiped,  then  the  indicator  function  of  fi,  call  it 
X  satisfies 

wp(xt)  =  0(  f1/p) 

as  e  goes  to  0.  Hence  error  estimates  in  terms  of  this  modulus  are  0(c1^p)  as 
e  — ►  0.  In  the  case  p  =  2  this  should  be  compared  with  estimates  in  terms  of 
Sobolev  space  type  norms  for  such  phantoms  which  are  0(e/3),  /3  <  1/2;  in 
particular,  see  the  introductory  discussion  in  Section  IV. 2  of  [41]. 

Polynomial  approximation  is  a  classical  subject,  see  [12,  56].  For  details 
and  various  formulas  concerning  classical  orthogonal  polynomials,  including 
Gaussian  quadrature,  see  [11,  18,  55].  The  polynomials  Pm,  m  =  1,2,..., 
described  in  Subsection  3.5.2  were  constructed  in  the  case  n  =  2  and  k  =  1 
in  [29,  30];  they  are  modifications  of  polynomials  considered  in  [12]  and  used 
for  similar  purposes.  Other  related  constructions  can  be  found  in  [31]. 

For  other  approaches  to  the  limited  angle  problem  see  [7,  41,  46]  and 
the  pertinent  references  cited  in  these  works.  For  other  types  of  restrictions 
see  [9,  21,  39,  41,  45]  and  the  pertinent  references  cited  there.  The  results  in 
Section  4.1  are  extensions  of  some  of  the  results  in  [32,  37].  The  notion  that  a 
set  is  in  O(m)  appears  to  be  related  to  the  notion  of  m-resolving;  see  Section 
III. 2  in  [41].  For  results  concerning  mechanical  quadrature  on  spheres  see 
[13,  41,  40,  54],  For  properties  of  spherical  harmonics  see  [11,  18,  53]. 

The  results  in  Section  4.2  are  an  extension  and  improvement  of  some 
of  the  work  recorded  in  [31]  and  announced  in  [32].  These  estimates  can 
be  used  to  obtain  acceptable  bounds  on  errors  due  to  discretization  of  the 
Radon  transform  in  the  angular  variables  but  not  in  the  l  variable.  For  such 
bounds  which  take  discretization  in  both  types  of  variables  into  account  but 
use  Sobolev  type  norms  see  [41,  Section  IV. 2]  and  the  pertinent  references 
cited  there;  note  however  the  restrictions  on  the  various  parameters  in  these 
results.  For  bounds  on  resolution  from  another  point  of  view  see  [22]. 

The  degree  of  approximation  or  resolution  from  limited  angle  data  de¬ 
pends  only  on  the  parameter  m  and  not  on  any  other  properties  of  the  set 
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A.  This  is  not  the  case  when  conditionedness  of  the  problem  is  taken  into 
account.  For  estimates  on  the  accuracy  of  the  data  required  for  a  given  reso¬ 
lution  in  the  reconstruction  see  [38];  for  results  related  to  resolution  and  the 
size  of  singular  values  see  [6,  26,  28,  41]  and  the  pertinent  references  in  these 
works. 
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Bounds  on  Multivariate  Polynomials  and 
Exponential  Error  Estimates  for 
Multiquadric  Interpolation 

W.  R.  Madych*  S.  A.  Nelson1 


Abstract 

A  class  of  multivariate  scattered  data  interpolation  methods  which 
includes  the  so-called  multiquadrics  is  considered.  Pointwise  error 
bounds  are  given  in  terms  of  several  parameters  including  a  parameter 
d  which,  roughly  speaking,  measures  the  spacing  of  the  points  at  which 
interpolation  occurs.  In  the  multiquadric  case  these  estimates  are 
0{\l!d)  as  d  0  where  A  is  a  constant  which  satisfies  0  <  A  <  1.  An 
essential  ingredient  in  this  development  which  may  be  of  independent 
interest  is  a  bound  on  the  s'ze  of  a  polynomial  over  a  cube  in  Rn 
in  terms  of  its  values  n  a  discrete  subset  which  is  scattered  in  a 
sufficiently  uniform  me  .ier. 

1  Introduction 

Let  h  be  a  continuous  function  on  Rn  which  is  conditionally  positive  definite 
of  order  m.  Given  data  (xj,  fj),  j  =  1 N,  where  .Y  =  pi, . . . ,  x^}  is  a 
subset  of  points  in  Rn  and  the  /j’s  are  real  or  complex  numbers,  the  so-called 
h  spline  interpolant  of  this  data  is  the  function  s  defined  by 

N 

(1)  s(x)  =  P(x)  +  E  CMX  ~  Xj) 

J=1 

where  p(x)  is  a  polynomial  in  Vm-\  and  the  c/s  are  chosen  so  that 

(2)  Yl  cA*j)  =  0 

r=i 
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for  all  polynomials  q  in  Vm-i  and 

N 

(3)  p(xi)  +  Y,  cMxi  ~  xi )  =  /f.  *  =  1.  •  •  • ,  N. 

j=i 

Here  Vm-\  denotes  the  class  of  those  polynomials  on  Rn  of  degree  <  m  —  1. 

It  is  well  known  that  the  system  of  equations  (2)  and  (3)  has  a  unique 
solution  when  A'  is  a  determining  set  for  Vm-i  and  h  is  strictly  conditionally 
positive  definite.  Thus,  in  this  case,  the  interpolant  s(z)  is  well  defined. 

We  remind  the  reader  that  AT  is  said  to  be  a  determining  set  for  Vm-i  if 
p  is  in  Vm- 1  and  p  vanishes  on  X  implies  that  p  is  identically  zero. 

If  h  is  the  function  defined  by  the  formula 


where  |z|  is  the  Euclidean  norm  of  x  then  m  =  1  and  corresponding  method 
of  interpolation  defined  by  (1),  (2),  (3)  and  (4)  is  often  referred  to  as  the 
multiquadric  method.  This  and  closely  related  methods  are  currently  quite 
fashionable,  see  [3,  8] 

In  an  earlier  paper  [6]  we  obtained  bounds  on  the  pointwise  difference 
between  a  function  /  and  the  h  spline  which  agrees  with  /  on  a  finite  subset 
A'  of  Rn.  These  estimates  involve  a  parameter  d  that  measures  the  spacing  of 
the  points  in  X  and  are  0{dl )  as  d  — *  0  where  l  depends  on  h.  The  results  of 
the  present  paper  imply  that  for  certain  h%  which  include  (4),  the  estimates 
can  be  improved  to  0{\l!d)  as  d  — *  0  where  A  is  a  constant  which  satisfies 
0  <  A  <  1.  The  conditions  on  /  are  are  same  as  those  in  [6]. 

1.1  A  bound  for  multivariate  polynomials 

A  key  ingredient  in  the  development  of  our  estimates  is  the  following  lemma 
which  gives  a  bound  on  the  size  of  a  polynomial  on  a  cube  in  Rn  in  terms 
of  its  values  on  a  discrete  subset  which  is  scattered  in  a  sufficiently  uniform 
manner.  This  result  may  be  of  independent  interest. 

Lemma  1  For  n  =  1,2,...,  define  7„  by  the  formulas  7i  =  2  and,  if  n  >  1, 
7„  =  2n(l  4-  7n— i  )  •  Let  Q  be  a  cube  in  Rn  that  is  subdivided  into  qn  identical 
subcubes.  Let  Y  be  a  set  of  qn  points  obtained  by  selecting  a  point  from  each 
of  those  subcubes.  If  q  >  7 n(k  +  1),  then  for  all  p  in  Vk 

sup]p(z)|  <  e2qn  sup  |p(y)|  . 

z€Q  vGY 

We  remark  that  it  is  not  essential  for  the  set  Y  to  intersect  every  subcube 
of  Q  as  hypothesized  above.  A  variant  of  this  lemma  where  Y  intersects  a 
certain  percentage  of  these  subcubes  can  be  found  in  subsection  3.3. 
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1.2  A  variational  framework  for  interpolation 

The  precise  sta  tement  of  our  estimates  concerning  h  splines  requires  a  certain 
amount  of  technical  notation  and  terminology  which  are  identical  to  that  used 
in  [6].  For  the  convenience  of  the  reader  we  recall  several  basic  notions. 

The  space  of  complex  valued  functions  on  Rn  that  are  compactly  sup¬ 
ported  and  infinitely  differentiable  is  denoted  by  V.  The  Fourier  transform 
of  a  function  <p  in  V  is 

«£(£)  =  J 

A  continuous  function  h  is  conditionally  positive  definite  of  order  m  if 

(5)  J  h(x)<p  *  <p(x)dx  >  0 

holds  whenever  <p  =  p(D)ip  with  ip  in  V  and  p(D)  a  linear  homogeneous 
constant  coefficient  differential  operator  of  order  m.  Here  <j>(x)  =  <p{—x)  and 
*  denotes  the  convolution  product 

<f> l  *  <M0  =  J  -  x)dx. 

Note  that  (5)  can  be  rewritten  as 

j  J  Kx  -  y)<p{x)<p(y)dxdy  >  0. 

In  what  follows  h  will  always  denote  a  continuous  conditionally  positive 
definite  function  of  order  m.  The  Fourier  transform  of  such  distributions 
uniquely  determines  a  positive  Borel  measure  p  on  Rn  ~  {0}  and  constants 
a-,,  M  =  2m  as  follows:  For  all  ip  G  V 

(6) 

f  h(x)ip(x)dx  =  f  lip(£)  -  X(0  Z?7V’(°)^7  }  dp(0  +  H  D'lipiO)^ 

'  \  h|<2m  J  M<2m 

where  for  every  choice  of  complex  numbers  ca,  |a|  =  m, 

(7)  X]  aa+pca^P  ^  0. 

|a|=m  \P\=m 

Here  \  >s  3  function  in  V  such  that  1  —  x(0  has  a  zero  of  order  2m  +  1 
at  £  =  0;  both  of  the  integrals  /0< |<i  |£|2md//(f),  dp{0  are  finite.  The 
choice  of  \  affects  the  value  of  the  coefficients  a7  for  |-y|  <  2m. 

If 

Vm  =  |<^  €  V  :  J  xa(p(x)dx  =  0  for  all  |aj  <  mj 
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then  Ch,m  is  the  class  of  those  continuous  functions  /  which  satisfy 


(8) 


J  f(x)<f>{x)dx  <  c(f )  {/ h(x  -  y)4>(x)<j>{y)dxdy J 


1/2 


for  some  constant  c(f)  and  all  <f>  in  Vm.  If  /  G  Ch,m  let  \\f\\h  denote  the 
smallest  constant  c(f)  for  which  (8)  is  true.  Recall  that  \\f\\h  is  a  semi-norm 
and  Ch,m  is  a  semi  Hilbert  space;  in  the  case  m  =  0  it  is  a  norm  and  a  Hilbert 
space  respectively.  Elements  /  in  Ch,m  are  of  the  form 


/  —  /i  +  h 


where  the  Fourier  transform  of  f\  is  given  by 

/i(0  =  diOMO 

with  g  in  L2(dp)  and  fz  is  a  polynomial  of  degree  m. 

Given  a  function  /  in  Ch,m ,  there  is  an  element  s  of  minimal  Ch,m  norm 
which  is  equal  to  /  on  X.  If  A*  is  a  determining  set  for  Vm~i  then  s  is  unique. 
We  refer  to  such  s  as  the  h  spline  interpolant  of  /  on  X. 

In  the  case  when  X  is  a  finite  subset  of  Rn  as  considered  in  beginning  of 
this  introduction  the  h  spline  s  is  given  by  (1),  where  /(x.)  =  /,,  t  =  1, . . . ,  N, 


1.3  Exponential  error  estimates 

Our  basic  theorem  concerns  how  well  s  approximates  /  in  regions  f?  where 
X  provides  sufficient  coverage.  In  other  words,  we  are  interested  in  bounds 
on  the  quantity 

1  f(x)  -  fOOj 
ii/IU 

where  x  is  in  Q;  the  estimates  should  be  in  terms  of  parameters  which  measure 
how  closely  X  covers  Q.  For  example,  the  parameter  d  =  d(Q,  X)  defined  by 


d(fl,  X)  =  sup  inf  | y  —  x\ 
yen  *€.Y 


is  one  such  measure. 

In  [6]  we  showed  that  in  many  cases  the  quantity  in  (9)  is  0(dk)  as  d  — *  0 
where  A:  is  a  constant  whose  maximum  value  is  determined  by  h.  In  this  paper 
we  restrict  our  attention  to  h’s  whose  corresponding  measures  n  defined  by 
(6)  satisfy  certain  moment  conditions.  For  example,  if  h  is  given  by  (4)  there 
is  a  positive  constant  p  such  that  for  all  integers  k  greater  than  2 

(10)  J  |£|*<M0  <  phk\ . 
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In  this  case  we  are  able  to  obtain  the  exponential  estimate  described  in  the 
abstract. 

In  subsection  2.3  we  consider  a  variant  of  (10)  where  kl  is  replaced  by  krk, 
r  an  arbitrary  real  constant.  As  might  be  expected,  this  leads  to  somewhat 
different  bounds  on  (9). 

Because  of  the  local  nature  of  the  result,  we  restrict  our  attention  to  the 
case  where  fi  is  a  cube. 

Theorem  1  Suppose  h  is  conditionally  positive  definite  of  order  m  and  the 
corresponding  measure  fi  satisfies  (10)  for  all  k  greater  than  2m.  Then,  given 
a  positive  number  bo,  there  are  positive  constants  6q  and  A,  0  <  A  <  1,  which 
depend  on  bQ  and  h  for  which  the  following  is  true:  If  f  €  Ch>rn  and  s  is  the 
h  spline  that  interpolates  f  on  X  then 

l/to  -  *(*)l  <  A'^H/lU 

holds  for  all  x  in  a  cube  E  provided  that  (i)  E  has  side  b  and  b  >  b0,  (ii) 
0  <  8  <  60,  and  (Hi)  every  subcube  of  E  of  side  6  contains  a  point  of  X . 

Observe  that  every  cube  of  side  6  contains  a  ball  of  radius  6/2.  Thus  the 
subcube  condition  is  satisfied  when  6  =  2 d(E,  X).  More  generally,  we  can 
easily  conclude  the  following: 

Corollary  1  Suppose  h  satisfies  the  hypotheses  of  the  Theorem,  fi  is  a  set 
which  can  be  expressed  as  the  union  of  rotations  and  translations  of  a  fixed 
cube,  and  X  is  a  subset  of  Rn .  Then  there  are  positive  constants  do  and  A, 
0  <  A  <  0,  which  depend  on  60  and  h  for  which  the  following  is  true:  If 
d  <  do,  }  6  Ch,m  an(I  s  Is  the  h  spline  that  interpolates  f  on  X  then 

l/W  -  »MI  <  *‘"11/11* 

holds  for  all  x  in  fl  where  d  —  d{ fi,  A  ). 

Note  that  any  ball  in  Rn  satisfies  the  hypothesis  on  fi  in  the  above  corol¬ 
lary.  Indeed,  any  set  fi  with  sufficiently  smooth  boundary  satisfies  this  hy¬ 
pothesis. 

2  Details  for  Theorem  1,  examples,  and  generalizations 

As  alluded  to  in  the  introduction,  Lemma  1  is  an  important  ingredient  in 
the  proof  of  this  theorem.  The  following  lemma,  which  is  a  transparent 
consequence  of  Lemma  1  and  routine  arguments  involving  linear  functionals, 
is  in  convenient  form  for  applying  this  ingredient. 
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Lemma  2  Let  Q,  Y ,  and  7„  be  as  in  Lemma  1.  Then,  given  a  point  x  in 
Q,  there  is  a  measure  a  supported  on  Y  such  that 

f  p(y)d<r(y)  =  p(x) 

for  all  p  in  Vk,  and 

J  d\cr\(y)  <  e2^”(fc+1)  . 

2.1  Proof  of  Theorem  1 

First,  let  p ,  7„,  and  b0  be  the  constants  appearing  in  inequality  (9),  Lemma 
1,  and  Theorem  1  respectively.  Let 

B  =  2 pi/ne2”'1'"  and  C  =  max  Ib,  -7-}  . 

I  36q  J 

Let 

c  1 

0  3Cy„(m  +  1)  ’ 

where  m  is  the  order  of  conditional  positive  definiteness  of  h.  We  will  show 
that  60  as  defined  above  can  be  used  for  the  constant  in  the  statement  of 
Theorem  1. 

For  now,  let  x  be  any  point  of  the  cube  E  and  recall  that  Theorem  4.2  of 
[6]  implies  that 

(1 !)  I f(x)  ~  «(*)|  <  ck\\f\\k  j\y~  x\kdcr{y) 

whenever  k  >  m  where  a  is  any  compactly  supported  measure  such  that 
(12)  J  p{y)dcr{y)  =  p{x) 

for  all  polynomials  p  in  Vk- 1-  Here 


whenever  k  >  m  and  by  virtue  of  (9) 

(13)  c„  <  (2p)*  . 


/2 


To  obtain  the  desired  bound  on  |/(x)  —  s(x) |  it  suffices  to  find  a  suitable 
bound  for 

I  =  ck  J  \y~  x\ kd<r(y)  . 

This  is  done  by  choosing  the  measure  a  appropriately.  We  proceed  as  follows: 
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Let  8  be  a  parameter  as  in  the  statement  of  the  Theorem.  Since  8  <  S0 
we  may  chose  an  integer  k  so  that 

(14)  1  <  3Cj nk6  <  2  . 

Note  that  such  a  k  is  >  m  -f  1  and  7 nk8  <  b0.  Let  Q  be  any  cube  which 
contains  x,  has  side  7 nk8,  and  is  contained  in  E.  Subdivide  Q  into  {~fnk)n 
congruent  subcubes  of  side  8.  Since  each  of  these  subcubes  must  contain  a 
point  of  A',  select  a  point  of  A'  from  each  such  subcube  and  call  the  resulting 
discrete  set  Y.  By  virtue  of  Lemma  1  we  may  conclude  that  there  is  a 
measure  0  supported  on  Y  which  satisfies  (12)  and  enjoys  the  estimate 

(15)  J  d\0\(y)  <  e2n'1nk  . 

We  use  this  measure  in  (11)  to  obtain  an  estimate  on  /. 

Using  (13),  (15),  and  the  fact  that  support  of  cr  is  contained  in  Q  whose 
diameter  is  ^/n^kS  we  may  write 

(16)  I  <  (2p)k{^lnk8)ktM  <  ( Clnk8)k  . 

Since 

(C^)  ^  f  and  t  > 
inequality  (16)  implies  that 

I  <  ((2/3)1/(SC7n))1/4  . 

Hence  we  may  conclude  that 

l/(*)-»WI<*,"ll/IU 

where 

A  =  (2/3)1/(3C7n)  . 


2.2  Examples 

A  well  known  class  of  examples  of  conditionally  positive  definite  h’s  is  given 
by  , 

h(x)  =  — — 

[)  (1  T  jr|2)a/2 

where  a  is  a  fixed  real  number  ^  0,  —2,  —4, . . .  and  j  is  the  classical  gamma 
function.  The  corresponding  measure  p  is  given  by 

mo  = 
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where  ca  is  a  positive  constant  and  Kv  is  a  modified  Bessel  function  of  the 
second  kind;  see  [6]  for  more  details  and  the  cases  a  =  0,  —2,  —4,  —  Because 
of  the  exponential  decay  of  Kv(t)  as  t  — *  c©  the  moments  of  of  p  grow  like 
pkk\  and  hence  p  satisfies  (9)  whenever  k  is  sufficiently  large. 

The  important  example  of  the  Gaussian 

h(x)  =  e"Ix|i 

has  corresponding  measure 

dp{£)  =  (2tt  )n'Vlfl’/4<iC 

of  course.  The  moments  of  p  grow  like  pk\/ki .  Although  Theorem  1  provides 
a  bound  on  the  error,  in  this  case  one  expects  better  estimates  because  of 
the  significantly  slower  than  hypothesized  growth  of  these  moments. 

More  generally,  consider  the  case  when  the  measure  fi  is  given  by 

dfi(£)  =  e-^  d£ 

where  a  is  a  positive  constant.  Here,  of  course, 

h(x)  =  J  . 

The  moments  of  fi  grow  like  pkkrk  where  r  =  1/a.  The  case  a  =  2  is 
essentially  the  Gausssian  which  together  with  the  rest  of  the  cases  a  >  1  is 
covered  by  Theorem  1.  On  the  other  hand  if  0  <  a  <  1  the  bcfund  on  the 
rate  of  growth  of  the  moments  hypothesized  in  the  statement  of  Theorem  1 
fails  to  hold. 

The  theorems  in  Subsection  2.3  provide  answers  to  the  questions  raised 
above. 

2.3  Generalizations 

As  mentioned  in  the  introduction,  different  bounds  on  the  rate  of  growth  of 
the  moments  of  the  measure  p  result  in  different  estimates  on  the  difference 
between  /  and  its  h  spline  interpolant  s  off  the  interpolated  set.  Here  we 
consider  the  case 

(17)  /  <  p‘*r‘ 

for  k  >  2m,  where  r  is  a  real  constants  and  p  is  a  positive  constant. 

Note  that  in  view  of  Stirling’s  formula  there  are  positive  constants  px  and 
p2  so  that 

(18)  pkkk  <k\<  pkkk  . 

Thus  the  case  r  =  1  was  treated  in  Theorem  1.  Also  observe  that  Theorem 
1  provides  an  estimate  in  the  case  r  <  1.  However  it  is  possible  to  get  a 
more  sensitive  estimate  in  this  case  without  much  more  work;  this  is  shown 
in  Theorem  3  and  its  proof.  We  first  consider  the  case  r  >  1. 
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Theorem  2  Suppose  h  is  conditionally  positive  definite  of  order  m  and  the 
corresponding  measure  p  satisfies  (11)  with  r  >  1  for  all  k  greater  than 
2m.  Then,  given  a  positive  number  bo,  there  are  positive  constants  So  and  X, 
0  <  A  <  1,  which  depend  on  h,  r,  and  bo  and  for  which  the  following  is  true: 
If  f  €  Chtm  and  s  is  the  h  spline  that  interpolates  f  on  X  then 

I /(*)  -  *(*)l  <  A4~1/r||/|U 

holds  for  all  x  in  a  cube  E  provided  that  (i)  E  has  side  b  and  b  >  60,  (ii) 
0  <  8  <  Sq,  and  (Hi)  every  subcube  of  E  of  side  S  contains  a  point  of  X . 

Proof  In  view  of  (17)  and  (18)  there  is  a  constant  p0  such  that 

/  If  wo  <  ft#’-1*  ■ 

Let  7„  and  60  be  the  constants  appearing  in  the  statements  of  Lemma  1  and 
Theorem  2  respectively.  Let 

B  =  2po'Jne2n'1n  and  C  =  max  {  B,  -7-}  . 

I  36n  J 

and  let 

8  1 
°  3rC7„(m  +  l)r  ’ 

where  m  is  the  order  of  conditional  positive  definiteness  of  h.  Let  S  be  a 
parameter  as  in  the  statement  of  the  theorem.  Since  8  <  80  so  that  3(Ci„8)llT 
is  less  than  1  and  we  may  choose  an  integer  k  so  that 

1  <  3(C7„5)1/r*  <  2  . 

Note  that  such  a  k  is  >  m  +  1  and  i„k6  <  b0. 

Proceeding  as  in  the  proof  of  Theorem  1  we  get 

(19)  l/(i)  -  *WI  <  Will 

where 

/  <  pk0k^k{^lnk6)ke2n-1nk  <  (( C-fn6y/rk)Tk  . 

Since 

(Cyn6ylTk  <  -  and  k>  •- — — 

V  '  ’  ~  3  -  3(CjnSyiT 

we  may  conclude  that 

/<  ((2/3)1/(3(c^’/r))6'"/r  . 

In  view  of  (19)  the  theorem  now  follows  with 

A  =  (2/3 )lIWci”V,r)  . 
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Theorem  3  Suppose  h  is  conditionally  positive  definite  of  order  m  and  the 
corresponding  measure  p  satisfies  (17)  with  r  <  1  for  all  k  greater  than  2m. 
Then,  given  a  positive  number  b0,  there  are  positive  constants  So,  c,  and  C, 
which  depend  on  h,  r,  and  b0  and  for  which  the  following  is  true:  If  f  €  Ch 
and  s  is  the  h  spline  that  interpolates  f  on  X  then 

i/w  -  «toi  <  («r,1ii/iu 

holds  for  all  x  in  a  cube  E  provided  that  (i)  E  has  side  b  and  b  >  b0 ,  ( ti ) 
0  <  6  <  6q,  and  (Hi)  every  subcube  of  E  of  side  6  contains  a  point  of  X . 

Proof  Let 

1 _ b0  } 

(Bbroyi('~'hn'2-,n(m  +  l)j 

where  7„  is  the  constant  defined  in  Lemma  1,  and 

B  =  po \fne2n~1n 


with  po  as  in  the  proof  of  Theorem  2  Then  if  8  <  S0  there  is  an  integer  k  so 
that 

Y  <  7n hk  <  bo  . 

Arguing  as  in  the  proof  of  Theorem  2  we  we  can  conclude  that 
(20)  |/(x)  -  s(x)\  <  /||/|U 

where  I  <  ( B~fnbkT)k  Since  k  <  6o/(7n^)  we  may  write 


'  £  1  B'J  [te 


and  since  B7*  Tbr061~T  <  1  and  k  >  b0/(2~fnb )  it  follows  that 

/  <  (S7^r6'6l"r)6o/(27n4)  • 

The  last  inequality  together  with  (20)  imply  the  desired  result  with 


c  =  (BbD'l^-'hn  and  c  =  (-~  r)&0- 

27„ 
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3  Details  for  Lemma  1 


We  begin  by  noting  that  it  suffices  to  prove  the  Lemma  in  the  case  Q  =  [0,  l]n. 
To  see  this,  let  Q  be  any  cube  in  Rn  and  let  <j>  be  an  affine  transformation 
mapping  [0,  l]n  onto  Q.  Then  polynomials  p  on  Q  are  related  to  polynomials 
/  on  [0, 1]"  via  the  correspondence 

f(x)  =  p{f>(x)) 

and  the  corresponding  subdivisions  and  discrete  subsets  Y  are  related  analo¬ 
gously.  It  is  clear  that  an  estimate  like  that  given  by  Lemma  1  on  the  size  of 
/  on  [0,  l]n  implies  the  corresponding  estimate  on  the  size  of  p  on  Q.  Hence 
in  what  follows  we  will  always  take  the  cube  Q  to  be  [0, 1]”. 

Our  proof  of  Lemma  1  involves  induction  on  the  dimension  n.  While 
Lemma  1  and  its  proof  are  elementary  and  well  known  in  the  case  n  =  1,  in 
the  first  subsection  we  formulate  it  in  a  manner  convenient  for  the  necessary 
induction  argument.  Since  the  general  case  involves  certain  unpleasant  com¬ 
binatoric  and  geometric  complications,  for  the  sake  of  clarity  we  spell  out 
the  argument  in  the  case  n  —  2  in  the  second  subsection.  The  general  case 
is  considered  in  the  third  subsection. 


3.1  The  case  n=l 

Proposition  1  Lei  T  =  {to, . . . ,  t^}  be  a  subset  of  the  unit  interval  [0,  1]  and 
assume  +  \/q  <  t,,  for  i  —  1, . . . ,  k.Then  for  all  p  €  Pk, 

sup  |p(0l  <  ^Tf-supl p{t)\. 

Proof  Recall  p  =  £jL0  p{t,)L,  where 


£.(<>=  n  ~ 


The  assumption  1  jq  <  t,  —  t,-\  implies  j ti~lj\  <  q/\i  —  j\.  Also,  —tj\  <  1 
for  all  t  €  [0,  1],  Hence,  for  such  l,  |L,(t)j  <  qk/[i\(k  —  i)!]  and 


ElMOls/E:  1 

1=0 


=  (29)! 

fr{  i\(k  -  «)!  k\ 


which  gives  the  desired  inequality. 


Since 


(2?)* 

k! 


<e2q 


this  is  a  simple  variant  of  Lemma  i  in  the  case  n  =  1. 


1 1 


3.2  The  case  n=2 

Proposition  2  Suppose  the  square  Q  =  [0,  l]2  is  divided  into  q2  identical 
subsquares  and  X  is  a  set  that  intersects  each  subsquare.  If  q  >  \2(k  +  1), 
then  for  all  p  €  Pj, 

(21)  sup  |p(z)|  <  e48(*+1)  sup  |p(z)|. 

x€Q  x€-Y 

Proof  Let  Qt,  i  G  /  denote  the  q 2  subsquares  of  Q.  If  q  >  12(k  4-  1)  =  q' 
then  Q  contains  squares  Q'  that  consist  of  exactly  ( q ')2  subsquares  Q,.  Since 

(21)  holds  for  Q  if  it  holds  for  all  such  Q\  we  assume  that  q  =  12(fc  +1). 
Instead  of  (21)  we  will  show  that  if  h  G  Vb  and  \h(x)\  <  1  for  all  x  6  A' 

then 

(22)  sup  1^|  <  e48(fc+1). 

Q 

That  this  implies  (21)  can  be  seen  by  considering  h  =  p/(e  +  supv  |p|),  e  >  0. 

Let  mi  =  mina<3i  \h\  where  d Qi  denotes  the  boundary  of  Q{.  Let  N0  be 
the  number  of  points  in  70  =  {i  €E  /  :  <  1}.  We  assert  that 

(23)  yV0>(12(*  +  l))2-(2*-l)2. 

To  see  this,  take  6  =  (6i,62),  let  y6(z)  =  |h(x)|2  +  (6xXi  4-  62^2)1  and  note 
that  for  every  i  €  I  \I0, 


mino0  <  1  <  (m,)2  =  min  o0- 
Q,  ~  dQ, 

Thus  we  can  choose  e  >  0  so  that  if  |6|  <  e  then  for  every  i  €  /  \  70 

min  gb<  min  gb. 

V»  oQi 

When  this  occurs,  gb  has  a  critical  point  in  the  interior  of  Qi.  Such  a  b  can 
be  chosen  so  that  all  the  critical  points  of  gb  are  nondegenerate,  for  example 
see  Lemma  6.2  on  page  40  of  [7].  Now  gb  €  P2*,  so  by  virtue  of  Proposition  4 
in  Subsection  3.4  it  can  have  at  most  (2k  —  l)2  nondegenerate  critical  points. 
Thus  I  \  Io  has  at  most  (2k  —  l)2  points  and  (23)  follows. 

For  each  i  €  Io  select  a  point  y,  G  dQi  so  that  \h(yi)\  <  1  and  so  that  y,- 
is  not  one  of  the  four  corners  of  Qi.  Partition  Io  into  four  subsets  /x, . . . ,  /4 
according  to  whether  y,  lies  on  the  top,  bottom,  left  or  right  edge  of  Qi.  Let 
be  the  number  of  points  in  I\  and  assume  without  loss  of  generality  that 
N 1  >  Nq/4. 

For  each  j  =  let  I(j)  be  the  set  of  i’s  for  which  Q,  lies  in  the 

horizontal  strip 


{(<,*)  :0<<<  1,(>- !)<?*<;}. 
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Let  N(j)  be  the  number  of  points  in  Ix  fl  I(j)  and  let  N  b->  the  number  of 
points  in  J  —  {j  :  N(j)  >  2(k  +  1)}. 

Note  that  Nt  =  £j=1  N(j)  <  Nq  +  (q  -  N){2k  +  1).  Using  q  =  12{k  +  1), 
this  gives 

Nx  <  N{10k+  11)+  12(*  +  1)(2&  +  1) 

If  N  <  k  then  N0  <  4 Nx  <  4[(10&2  + 11&) +  I2(k  +  \)(2k  +  1)]  which  is  easily 
seen  to  violate  (23).  We  conclude  that  N  >  k  +  1. 

Let  pj(l)  =  h(t,j/q).  In  N(j)  of  the  intervals 

r  —  1  r 

-  <  t  <  ",  r  =  1,...,? 

9  9 

there  is  a  point  i  with  \pj{t)\  <1.  If  j  €  J  there  are  at  least  2(£  +  1)  such 
intervals.  Thus  we  can  apply  Proposition  1  to  pj  and  see  that 

(24)  max  |p,(i)|  <  e2<?  =  e24^+1^ 

for  every  j  €  J .  Using  this  and  the  fact  that  J  has  N  >  k  +  1  points  we  can 
apply  Proposition  1  again,  this  time  to  p(s)  =  h(a,  s ),  a  €  [0, 1]  to  arrive  at 
(22).'  ■ 


3.3  The  general  case 

Proposition  3  Define  7„  for  n  =  1,2,...  by  =  2,  7„  =  2n(l  +  7n-i))  n  > 

1.  Let  A'  be  a  subset  of  Rn ,  let  r  €  (0, 1]  and  let  k  and  q  be  positive  integers 

unth  q  >  7„(1  +  l)/r.  Subdivide  the  unit  n-cube  [0, 1]"  into  qn  identical 
subcubes  and  let  N  be  the  number  of  such  subcubes  that  intersect  X .  If 
N  >  rqn  then  for  all  f  €  Vk 

(25)  sup  |/(i)|  <  sup|/(z)| 

*€[0,l]n  \  /  x€.Y 

Proof  We  first  deal  with  the  case  n  =  1.  In  that  case  the  subcubes  are  the 

intervals  /,•  =  [(i  —  1  )/q,  i/q],  i  —  Let  i(  1)  <  i(2)  <  . . .  <  i(N)  give 

the  intervals  that  intersect  X.  For  each  j  =  1 ,N  choose  x(j )  e  /<(,-)  n  X. 
By  assumption,  N  >  rq  >  2 (k  +  1).  The  points 

=  z(l),  *i  =  x(3),  —  i(l  +  2k) 

satisfy  t}  —  tj- 1  >  l/q  so  (25)  follows  from  Proposition  1. 

To  complete  the  proof  we  use  induction  on  n.  The  integers  k  and  q  will  be 
held  fixed  during  the  induction.  Let  n'  =  n—  1  and  define  r'  by  7 „>/r'  =  7 n/r. 
Then  q  >  7 n>(k  +  1  )/r'.  Subdivide  the  unit  n'-cube  [0, 1]”’  into  qn'  identical 
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subcubes  and  let  N'  be  the  number  of  such  subcubes  that  intersect  X'  C  Rn' . 
If  N'  >  r'qn'  then,  by  induction,  for  all  g  €.Vk 


sup  \g\  < 

[0,1]"' 


(2  g)kY 
kl  i 


sup  |^| 
X 


Instead  of  (25)  we  will  show  that  if  h  €  Vk  and  |ft(z)|  <  1  for  all  x  €  X 
then 

(27)  sup  |*|  < 

[0,1]”  \  k-  ) 

That  this  implies  (25)  can  be  seen  by  considering  h  =  p/(e-f  supv  |p|),  e  >  0. 

Let  Q  denote  the  family  of  qn  subcubes  of  [0, 1]".  For  each  Q  £  Q  let 
mq  =  min^  |/i|  where  dQ  denotes  the  boundary  of  Q.  Let 

Qh  =  {Q  €  Q  :  mQ  <  1},  Qx  =  {Q  €  Q  :  Q  D  X?  0}. 

Note  that  N  is  the  number  of  elements  in  Q  and  let  Nh  be  the  number  of 
elements  in  £?h-We  assert  that 

(28)  Nh>N-  (2  k)n. 

To  see  this,  for  6  €  Rn  consider  the  the  function  gb  defined  by 


gb{x)  =  \h(x)\2  +  (6jii  +  . . .  +  bnxn). 


If  Q  €  Q.\  \  Qh  then 


min  <7o  <  1  <  min  ^0- 
Q  dQ 


Thus  we  can  choose  e  >  0  so  that  for  all  Q  €  Qx  \  Qh  and  all  |6|  <  e 

min  <  min 0b- 
Q  9Q 

When  this  holds,  it  is  evident  that  gb  has  a  critical  point  in  the  interior  of 
Q.  Thus  0b  has  at  least  N  —  i\\  critical  points.  Such  a  b  can  be  chosen  so 
that  the  critical  points  of  gb  are  nondegenerate,  see  Lemma  6.2  on  page  -10  of 
[7],  Since  gb  6  by  virtue  of  Proposition  4  in  Subsection  3.4  it  can  have 
at  most  (2k  —  l)n  nondegenerate  critical  points.  Thus  N  —  Nb  <  (2k  —  l)n 
which  gives  (28). 

For  each  Q  d  Qh  a  point  y(Q )  €  dQ  can  be  selected  so  that  |/i(y(<?))|  <  1. 
By  moving  y(Q)  slightly,  if  necessary,  it  may  also  be  assumed  that  y(Q)  lies 
on  exactly  one  of  the  hyperplanes 


=  {V  €  Rn  :  Vm  =>/?},  m  ~  l,...,n,>  =  O,...,0. 
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Let  Nh(m,j)  be  the  number  of  Q's  for  which  y(Q )  €  Mm,j-  Let  Arfem  = 
Nh(m,j),  and  note  that  Nh  =  A/ *,m  Without  loss  of  generality 

we  assume  JVhn  >  N^/n. 

Let  Y  =  { y(Q )  :  Q  €  <2/» } ,  1  j  =  L  H  Mn<j.  In  each  hyperplane  Mn%j  there 
are  qn~l  (n  —  l)-cubes  that  correspond  to  the  subdivision  of  [0, 1]"  into  qn 
n-cubes.  Let  N(Yj)  be  the  number  of  (n  —  l)-cubes  in  Mnj  that  intersect  Yj. 
Then  N(Yj)  >  iVA(n,  j)/2  because  for  each  (n  —  l)-cube  Q'  in  Mnj  there  are 
at  most  two  n-cubes  Q  €  Q  which  contain  Q'.  Thus  we  have 

(29)  2±N(YJ))>Nh,n>Nh/n. 

j=0 


(30) 


If  N(Yj)  >  r'qn  1  then  from  (26)  we  get 

f(2q)k^-1 


\h(x'J/Q)\  < 


\  *'• 


sup|h|  < 


for  all  x'  €  [0,  l]n_1.  Let  J  =  {j  :  N(Yj)  >  r'g”-1}.  We  will  show  below 
that  J  has  at  least  k  T  1  elements.  This  allows  us  to  apply  Proposition  1  to 
p(t)  =  h(x\i).  The  result  is 

tMx\h(x'J/q)\ 

for  every  t  €  [0, 1].  Because  of  (30),  this  gives  (27). 

Let  s  be  the  number  of  elements  in  J.  It  remains  to  show  s  >  k  -f  1.  For 
all  j,  N (Yj)  <  qn~l  and  for  j  £  J ,  jV(Vj)  <  r'qn~l.  Thus 

E^0>)<»»’"  +  (l  +  »-*)rV‘. 

j=o 

Combining  this  with  (30), (28)  and  the  hypothesis  N  >  rqn  gives 
^-(r9n  -  (2 k)n)  <  £  iV(L})  <  sqn~l  +  (1  +  q  -  s)r'qn~l 

j=0 


or,  after  division  by  qn  1 , 

rq_  _  (2 k)nq 

2  n  qn2n 


(31) 


—  (1  +  q)r'  <  s(l  —  r') 


By  definition  of  r',  r  --  r'-/n/ 7„_[  with  =  2n(l  +  7„_i).  Hence  r/2 n  = 
r'(l  +  7„_i)/7„  or  r/2 n  —  r'  —  r'/7„-i.  Thus  (31)  can  be  rewritten  as 


r'<] 
7n— I 


(^7  +  r') 


By  assumption  we  have  q  >  7n(fc  +  l)/r  =  7„_1(fc+l)/r'.  Taking  A/  =  yn-i/r' 
in  the  following  lemma,  we  find  (1  —  r')(&  +  l)  <  s(l  — r')  which  gives  s  >  k+ 1. 
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Lemma  3  If  n  >  2,  k  >  1,  r'  G  (0,  1],  Mr'  >  2  and  q  >  M(k  +  1)  then 
(32) 


Proof  From  k  <  k  +  1  <  g/Jl/  we  have  £/g  <  l/i\/  <1/2  and 
/2*\n^(*  +  l)  /  2 \2  M(k  +  1)  k+  1  2Jfc 


2n 


*(£)' 


<  — ^  <  —  <  kr'. 
2  n  ~  M  ~  M  ~ 


Multiplying  this  by  —1  and  then  adding  1  +  k  gives 

i+*-*^GH7)"£h(t+i)- 


Hence 


l  +  *-*r'<  --  - 


2k\n  1 


M 


2n . 


which  is  the  same  as  (32). 


3.4  Critical  points  of  polynomials 


Proposition  4  If  p  is  a  real  valued  polynomial  on  Rn  of  degree  d  then  p  can 
have  at  most  (d  —  l)n  nondegenerate  critical  points. 


Proof  A  simple  argument  for  the  case  n  —  2  goes  as  follows:  Let  q  be 
the  greatest  common  factor  of  dp/dxi  and  dp/d i2,  and  write  dp/dx ,  =  qpi , 
i  =  l,2.  If  q  vanishes  at  i0  then 


d2p 

dx{dxj 


(i0)  =  Pj{x  o)^(xo), 


=  0. 


Hence  xo  is  a  degenerate  critical  point.  At  any  nondegenerate  critical  point  zo 
we  therefore  have  pi(z0)  =  0  =  p?(xo)-  Since  p\  and  p2  have  no  common  fac¬ 
tor,  the  two  variable  version  of  Bezout’s  theorem,  for  example  see  [10],  implies 
that  the  number  of  such  points  z0  does  not  exceed  N  =  (degpj)(degp2)  < 
(d—  l)2.  The  lack  of  such  a  convenient  form  of  Bezout’s  theorem  when  n  >  2 
is  what  makes  the  general  case  more  difficult. 

To  obtain  a  proof  in  the  general  case  we  begin  by  observing  that  it  is  a 
corollary  of  its  complex  analogue.  Indeed,  there  is  a  unique  P  €  Vd(Cn)  such 
that  p(x)  =  P(x  +  tO)  for  all  x  G  Rn.  Here  and  in  what  follows  t  =  \f— T. 
From 


dp  dP .  . 

a7(l)  =  dTf* +  ,0) 

and  the  corresponding  formula  for  second  order  partial  derivatives,  it  is  clear 
that  if  zo  is  a  nondegenerate  critical  point  of  p  then  zo  =  *o  +  t0  is  a  nondegen¬ 
erate  critical  point  of  P.  Thus  the  general  case  follows  from  next  proposition. 
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Proposition  5  If  p  G  Vd(Cn)  then  p  can  have  at  most  (d—  l)n  nondegenerate 
critical  points. 

Proof  For  j  =  let  pj  =  dp/dzj.  All  critical  points  of  p  are 

degenerate  if  pj  =  0,  so  we  assume  pj  ^  0  for  all  j.  Let  m  =  dimP^C”);  we 
identify  points  c  G  Cm 

(33)  C  —  (Ca)|a|<<i  “  (^a  ""F  a  lb 

with  points  (a,  6)  G  P2m.  For  z0  G  C,  z  G  C”  and  c  G  Cm  let 

f(z0,z,c)  =  cQ2a4-H. 

|a|<d 

Let  cp  be  the  point  in  Cm  such  that  p(z)  =  f(l,z,cp)  for  all  z  G  C”.  Note 
that  pj(z)  =  /j(  1,  z,  cp)  where  /,  =  d f/dzjt  j  =  1, . . . ,  n. 

Let  z(l), . . . ,  z(N)  be  nondegenerate  critical  points  of  p.  Put  ^  = 
(l,z(r)),  r  =  1  and  observe  that  z  =  A  G  C  is  a  solution  of 

system  fj(z,cp)  =  0,  j  =  By  Bezout’s  Theorem,  [9],  if  n  homoge¬ 

neous  equations  fj( z)  =  0,  in  n  -F  1  variables  z  =  (z0,  z)  have  only  a  finite 
number  of  solution  rays  z  =  r  =  G  Cn+1  \  {0},  then 

q  <  (d  —  1)”  where  d  —  1  is  the  degree  of  j  =  1, . . . ,  n.  The  desired  con¬ 
clusion,  N  <{d—  1)",  would  follow  if  we  knew  that  the  system  /j(z,  cp )  =  0, 
j  =  1, . . . ,  n  had  only  a  finite  number  of  solution  rays.  The  latter  may  not 
be  true,  but  it  suffices  to  show  that  we  can  perturb  cp  to  obtain  a  point 
c  G  Cm  for  which  the  number,  qc,  of  solution  rays  of  the  system  fj( Z,  c)  =  0, 
j  =  1, . . . ,  n  is  finite  and  satisfies  qc  >  N . 

First  we  show  that  qc  >  N  is  automatic  if  c  is  close  enough  to  cp.  Consider 
the  map  T  from  Cn  x  Cm  to  Cn  given  by 

T{z,c )  =  (/i(l,2,c),...,/„(l,z,c)). 

The  points  z(t)  are  nondegenerate,  so  the  n  x  n  matrix  dT/dz  is  nonsingular 
at  (z(z),Cp),  i  =  1  By  the  Implicit  Function  Theorem  there  are 

analytic  functions  Q  on  a  neighborhood  B  C  Cm  of  cp  such  that 

nc.(c),c)  =  0,  Ci(cp)  =  z(i),  i=l,...,N. 

By  making  B  smaller,  if  necessary,  it  may  be  assumed  that  Q(c)  ^  (j(c)  for 
all  c  G  B  and  all  z  ^  j.  It  is  then  evident  that  qe  >  N  for  all  c  G  B. 

To  complete  the  proof  we  will  establish  that  for  almost  every  point  (a,  6)  G 
R7m,  the  system  fj( z,  a  +  ib)  =  0,  j  —  1, . . . ,  n  has  only  a  finite  number  of 
solution  rays.  For  k  =  0, . . . ,  n  define  maps  Jk  from  R2n  to  {z  G  Cn+1  :  z*  =  0 
by 

J  (ii,  •  •  •  >  2n)  —  (i!  “H  zz„, . . . ,  Xh  -I-  zzn^.^,  1,  X)t+\  -F  tzn+>,+i, . . . ,  zn  T  tZ2n)* 
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Let  \  '(k ,  a,  b)  =  p|”=i {x  €  R2n  :  fj{Jk(x),  a  +  ib)  =  0}.  The  maps  Jh  provide 
coordinate  systems  for  complex  projective  n  space.  By  compactness  of  that 
space,  it  suffices  to  prove  that  V(k,  a ,  b)  consists  of  isolated  points  for  every 
k  =  0, . . . ,  n  and  almost  all  (a,  6)  €  R2n . 

The  proof  of  this  uses  a  theorem  from  [7].  To  prepare,  define  /j  Q(z), 
|a|  <  d  by 

2)  =  (*o)J'w§^  =  fk(2o.*.‘0 

and  identify  /jQ  with  an  n  x  m  matrix. 

We  assert  that  fj,a{Jk{x))  has  rank  n  for  every  k  =  0 ,...,n  and  every 
x  €  R2n.  For  <:  /  0  we  take 

a  =  a(i,  k)  =  e(i)  +  (d  —  l)e(&),  i  =  1, . . . ,  n 

where  {e(  1), . . . ,  e(n)}  is  the  standard  basis  for  Rn  and  consider  the  n  x  n 
matrix  Fjti(x,k)  =  fj,a(i,k)( Jk(x))-  Then  F^k{x,k)  =  d,  Fj,j(x,k)  =  1  for 
j  i1  k  and  Fj,  i(x,  k)  =  0  for  j  ^  i,  i  ^  k.  It  follows  that  det(Fj t,(r,  k ))  =  d, 
4^0;  the  off  diagonal  entries  of  the  ktK  column  of  F  are  not  needed  for  this. 
For  k  =  0,  the  n  x  n  matrix  Fjti(x,  0)  =  /j,e(«)(^°(z))  is  seen  to  be  6{j  and 
our  assertion  is  verified. 

To  obtain  notation  more  like  [7]  we  fix  k  €  {0, . . . ,  n}  and  define  real 
valued  functions  U j, . . . ,  f/2n  by 

Uj(xi  ~F  j+n{x ,  a ,  6)  =  fj(J  (i),  a  +  zfc). 

Using  the  analysis  of  Fjti(x,  k )  above,  we  see  that  the  2 n  x  2(n  +  m)  matrix 
of  partial  derivatives  of  Ult . . . ,  (/2n  has  rank  2n.  By  Theorem  7.1  on  page 
50  of  [7]  we  conclude  that  for  almost  all  (a,  6)  €  R2n,  the  2 n  x  2n  matrix 
1^(1,  a,  6)  is  nonsingular  at  every  point  in 

V(k,  a,b)  =  f]  {x  e  R2n  :  <7,(r,  a,  b)  =  0}  . 

1=1 

Thus  for  such  (a,  6)  the  points  in  V(k,a,  b)  are  isolated.  ■ 

4  Miscellaneous  remarks 

Note  that  it  follows  from  Lemma  1  that  Y  is  a  determining  set  for  Vk ■  We 
also  note  that  in  the  cases  where  Y  is  regularly  distributed  in  Q,  for  example 
if  it  consists  of  the  centers  of  each  of  the  subcubes,  then  the  lemma  can  be 
derived  by  more  traditional  methods.  Indeed,  in  the  example  mentioned 
above,  it  is  an  easy  consequence  of  the  results  in  [1]. 

The  analogues  of  Corollary  1  for  Theorems  2  and  3  are  clear.  It  is  also 
clear  that  the  analogues  of  Lemma  1  and  the  Theorems  hold  when  the  cubes 
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are  replaced  by  more  general  parallelepipeds;  simply  apply  an  appropriate 
affine  transformation.  Thus  analogues  of  Corollary  1  hold  when  Q  satisfies  an 
interior  cone  condition.  Since  our  results  seem  to  apply  to  most  reasonable 
situations  we  refrain  from  exploring  further  generalizations. 

If  the  measure  p.  satisfies  (17)  with  r  <  0  then  it  must  have  compact 
support.  Also  recall  that  in  this  case  the  constant  C  can  be  taken  to  be 
independent  of  b0.  Since  the  exponent  c  is  (1  —  r)b0/(2jn),  if  6  is  such  that 
Cb  <  1,  letting  b0  — ►  oo  it  is  clear  that  | f(x)  —  s(z)|  — *  0.  In  other  words, 
for  sufficiently  small  b  if  the  intersec  , on  of  A’  with  any  cube  of  side  b  is  not 
empty  then  s( x)  =  f(x )  on  Rn.  This  means,  of  course,  that  the  values  of 
/  on  A  uniquely  determine  /.  Tne  implications  of  this  to  sampling  theory, 
such  as  that  found  in  [2]  or  [4]  for  example,  will  be  explored  elsewhere. 
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Abstract 

Interpolation  with  linear  combinations  of  translates  of  a  conditionally 
positive  definite  function  A  are  considered.  Such  interpolants  are  solutions 
to  a  constrained  variational  problem  involving  the  data  and  a  quadratic 
form  determined  by  A.  In  the  case  where  the  data  is  the  restriction  of  an 
appropriate  function  /  to  a  subset  X  in  Rn,  pointwise  estimates,  which 
are  valid  in  a  neighborhood  of  A',  are  given  on  the  difference  between  / 
and  its  interpolant. 


1  Introduction 

Suppose  A  is  a  continuous  function  on  Rn  which  is  conditionally  positive  definite 
of  order  m.  The  reader  not  familiar  with  this  notion  should  restrict  his  attention 
to  the  case  m  =  0  and  simply  take  A  to  be  the  Fourier  transform  of  a  positive 
integrable  function,  e.g.  A(x)  =  e_lr^  or  A(x)  =  (1  +  |x|2)“(*+1^2;  the  case 
m  =  1  includes  the  examples  A(x)  =  |x|  and  the  celebrated  multiquadric  A(x)  = 
\/l  +  |x|2.  Such  functions  A  need  not  be  radiaL  Detailed  definitions  can  be 
found  in  Section  2.  More  examples  and  citations  can  be  found  in  Section  4. 

Given  data  (x^,  /,),  j  =  1, . . . ,  A’,  where  the  ff  s  are  scalars  and  the  Xj's  are 
distinct  points  in  /?*,  consider  interpolants  of  the  form 

N 

(1)  5(1)  =  p(x)  +  ^]c^A(x-x>) 

j=i 

where  p  is  a  polynomial  in  Pm~  1,  the  space  of  polynomials  of  degree  <  m  —  1, 
and  Cj's  are  scalars  such  that 

N 

(2)  p(x*)  +  Y^Cjh(x  -  x})  =  fk  ,  k  = 

j=i 

N 

(3)  c}Xj  =  0  for  all  a,  |aj  <  m  —  1  . 

>=» 

We  call  such  interpolants  h-splinea.  We  also  remind  the  reader  that  standard 
multi-index  notation  is  used  in  (3).  For  convenient  reference  in  what  follows  we 
denote  the  set  of  points  {x;}^j  by  the  X,  namely,  X  =  Note  that  in 

‘Partially  supported  by  a  grant  from  the  Air  Force  Office  of  Scientific  Research,  AFOSR- 
89-0524 
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the  case  of  m  =  0  the  polynomial  p  does  not  appear  in  (1);  in  other  words,  in 
this  case  p  =  0  and  (3)  is  vacuous. 

Assuming  that  the  solution  of  (2)-(3)  exists  (it  always  does  in  the  examples 
cited  above)  and  the  values  are  the  restrictions  of  an  appropriate  func¬ 

tion  /  to  the  set  A',  we  are  interested  in  estimates  of  the  error  |/(x)  —  s(x)|  for 
x  in  some  open  neighborhood  fi  of  A".  Of  particular  interest  is  the  asymptotic 
behavior  of  the  estimate  as  the  neighborhood  fl  stays  fixed  and  coverage  by  A’ 
increases. 

It  should  be  clear  that  to  obtain  bounds  on  the  error  certain  restrictions  on 
/  are  necessary.  The  restriction  'onsidered  below  seems  to  be  a  natural  one, 
namely,  /  is  taken  to  be  such  that  the  corresponding  h-spline  s  is  the  “minimum 
norm”  interpolant  of  /  on  A'  in  a  sense  which  is  made  more  precise  in  Section  2. 
Indeed,  in  the  estimates  given  below,  it  is  not  required  that  the  set  A’  be  finite 
or  discrete,  only  that  s  be  the  appropriate  minimum  norm  interpolant  of  /  on 
A'. 


2  Variational  formulation 

In  what  follows  all  integrals  are  taken  over  all  of  72*,  V  is  the  class  of  infinitely 
differentiable  functions  with  compact  support, 

Vm  =  {< i>  :  J  x°  f>{x)dx  =  0  for  all  a,  ja|  <  m)  , 
and  the  Fourier  transform  f  of  an  integrable  function  /  is  defined  by 

/(0  =  J  f(x)e-'«’x>dx. 

All  Fourier  transforms  are  to  be  interpreted  in  the  distributional  sense. 

A  continuous  function  h  on  Rn  is  said  to  be  conditionally  positive  definite  of 
order  m  if  and  only  if 


0) 


J  J  h(x  -  y)d>(x)<j>(y)dxdy  >  0 


for  all  d>  in  Vm.  The  class  CCPDm  is  the  class  of  all  continuous  positive  definite 
functions  on  Rn . 

If  h  is  in  CCPDm  then  h  is  a  tempered  distribution  whose  Fourier  transform 
h  is  the  sum  of  a  positive  Radon  measure  dp.  on  Rm  \  {0}  and  a  distribution 
supported  at  the  origin.  The  measure  is  such  that  /(|f|/(l  +  |f|))2md/x  <  oo 
and  the  distribution  supported  at  the  origin  is  of  the  form  P(D)6  where  6  is 
the  Dirac  measure  at  the  origin  and  P{D)  =  J2-,a-r^  *s  a  partial  differential 
operator  of  order  <  2m  whose  principle  part,  terms  homogeneous  of  degree  2m, 
satisfies 

Y  ac+Pc»c0>  o 

|o|=m  |0|=m 


for  every 
then 


(5) 


choice  of  complex  numbers  {c0}|0|=TO.  In  other  words  if  is  in  Vm 

J  i  h(x  ~  y)^ix^(y)dx dy  ~  J  l^(OI2(^  +  ll^(m)(0)ll24 


where 


\\i{m)m\7A 


Da4>(0)  D^<^(0) 

2^  2w  a°+f>— y— 
|o|=m  |0|=m 
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If  h  is  in  CCPDm  then  the  class  C h  is  defined  as  follows:  /  is  in  Ch  if  and 
only  if  /  is  a  tempered  distribution  and  there  is  a  constant  C  such  that 

(6)  \{f ,  <t>)\  <  C {  J  J  k(x  -  y)<p(x)<p(y)dxdy}1/2 

for  all  <f>  in  Vm.  The  semi-norm  ||/|U  *s  the  infimum  of  all  C’s  which  do  the  job 
in  (6). 

In  view  of  (5)  it  is  clear  that  (6)  is  equivalent  to 

i(/,*>i  <  imu{  J  mtfdn  +  ik(m)(o)ii2^}i/2 , 

from  which  it  is  not  difficult  to  conclude  the  following: 

•  The  null  space  of  the  semi-norm  ||  •  is  Vm-\,  the  class  of  polynomials 
of  degree  <  m  —  1 

•  If  /  is  in  Ch  then  /  =  f\  +  /2  where 

(7)  j\  =  gdy  with  g  in  L2(R',dy) 

and  /2  is  a  polynomial  of  degree  <  m.  Of  course  (7)  means  that  /  \g(£)\2dy 
is  finite  and  for  any  test  function  <j>  we  have  {/i  ,<t>)  =  f 

•  If  d/r  is  absolutely  continuous  with  respect  to  Lebesgue  measure,  namely 
dy  =  u?(£)df ,  and  P(D)  has  no  terms  of  order  2m  then 

^  ii/iu  =  X  ~r 

|o|=m 

and  Ch  can  be  characterized  as  the  class  of  those  distributions  /  for  which 

(8)  is  finite.  Formula  (8)  follows  from  the  fact  that  in  this  case  /i(f)  = 
g(£)w{£)  and  /2  is  a  polynomial  of  degree  <  m-1  so  that  £"/(£)  =  f°/i(f) 
if  |a|  >  m  and 

J\9{()\7dy=  j  = 

=  X  ^  /  ir sttMOi2  (ki a-»(0)‘1«. 

|a|=m  ■  J 

•  If  /  is  in  Ch  then  /  is  continuous  on  R* . 

•  If  v  is  a  finite  measure  with  compact  support  which  annihilates  polyno¬ 
mials  of  degree  <  m  —  1  then  u  *  h  is  in  Ch-  Recall  that 

v  *  h(x)  =  J  h(x  -  y)dw(y) 

and  v  is  said  to  annihilate  Pm-\  if  and  only  if  J  p(x)dv(x)  =  0  for  all  p  in 
Vm-l. 

Suppose  h  is  in  CCPDm ,  f  is  in  C-h,  and  A'  is  a  closed  subset  of  R%  which 
is  unisolvent  for  Pm-i.  (Recall  that  A’  is  said  to  be  unisolvent  for  Vm-i  if  and 
only  if  the  only  polynomial  in  Vm-i  which  vanishes  on  A*  is  the  constant  0.) 
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Then  the  minimal  h  norm  interpolant  of  f  on  A  is  the  unique  element  s  in  Ch 
which  satisfies 

(9)  s(x)  =  f(x)  for  all  x  in  A' 
and 

(10)  IMU  =  minfljjHfc  :  g  6  C*  and  g(x)  =  /(x)  for  all  x  in  A'}  . 

As  a  consequence 

(u)  ll/ll*  =  ll/-*ll*  +  IMI* 

whenever  s  is  a  minimal  h  norm  interpolant  of  /. 

As  mentioned  in  the  introduction  the  main  results  considered  here  concern 
estimates  of  the  error  |/(x)  —  s(x)|  for  x  in  a  neighborhood  fl  of  X.  Such  results 
are  essentially  of  a  local  nature  and  depend  upon,  among  other  things,  how  well 
points  in  fl  are  approximated  by  points  in  A’. 

Suppose 


(12)  the  inequality  (4)  is  strict  for  all  non-zero  <j>  in  Vm 

and  A  =  {xi, . . . ,  Xu  }  is  a  finite  subset  of  Rn  which  is  a  uni-solvent  for  Vm _j. 
In  this  case,  given  any  collection  of  complex  numbers  {/..  there  is  an  / 

in  Ch  such  that  f(x})  =  f},  j  =  1  ...,1V,  and  the  minimal  h  norm  interpolant 
of  f  on  A  is  the  A-spline  defined  by  (l)-(3).  Note  that  (12)  is  implied  by  a  mild 
condition  on  the  support  of  A  and  that  all  the  examples  mentioned  above  enjoy 
(12). 


3  Error  estimates 

On  what  follows  we  always  assume  that  0 

•  h  is  in  CCPDm 

•  /  is  in  C-h 

•  A  is  a  closed  subset  of  Rn 

•  s  is  the  minimal  h  norm  interpolant  of  /  on  A. 

Our  first  estimate  includes  the  quantity  £*(fi,  e)  which  is  defined  by 

Ek(h,c)=  inf  {  sup  jh(x)  —  p(x)|} 

J’67’*  |*|<« 

Theorem  1  Suppose  fl  is  a  cube  of  side  b,  b  >  60  >  0,  and  k  >  m  —  1.  Then 
there  is  a  positive  constant  So  =  S0(bo,k)  such  that  if  every  subcube  of  Cl  of  side 
S  where  S  satisfies  0  <  5  <  50>  contains  a  point  of  X  then 

(13)  |/(x)  -  s(x)|  <  C1||/|U{^(A,  C2S )}''* 

for  all  x  in  Cl  where  C\  and  C?  are  positive  constants  which  are  independent  of 
f  and  5. 

The  proof  of  Theorem  1  can  be  found  in  [6],  The  next  result  requires  further 
hypothesis  on  h.  More  specifically,  if  p.  is  the  Radon  measure  associated  with  h 
as  in  (5)  then  we  will  assume  that  for  sufficiently  large  k 

(H)  J\i\kdp(t)<pkk'k 

where  r  >  0  and  p  >  0  are  fixed  constants. 
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Theorem  2  Suppose  the  Radon  measure  p  associated  with  h  satisfies  (if)  Jot 
some  fixed  constants  r  >  0  and  p  >  0  and  all  integers  k  >  kt,.  Also  suppose  fl 
is  a  cube  of  side  b,  b  >  fe0  >  0-  Then  there  are  positive  constants  5y  =  S„ (be,  h) 
and  A  =  A(6q,  h),  0  <  A  <  1,  suck  that  if  every  subcube  of  fl  of  side  6  where  S 
satisfies  0  <  S  <  Sq,  contains  a  point  of  X  then 

O5)  I/M  ~  *MI  <  ^  *li/IU 

for  all  x  in  fl  where  a  =  min{l,l/r}.  Furthermore,  if  0  <  r  <  1,  there  are 
positive  constants  c  =  c(&o,  h,  r)  and  C  =  C(bo ,  h ,  r)  such  that 

(is)  |/(i)  -  ,(*)|  <  (csy/s\\f\\k 

for  all  x  in  fl  whenever  6  is  sufficiently  small. 

The  proof  of  this  theorem  in  the  general  n  variate  case  is  technically  rather 
complicated.  For  simplicity  and  clarity  we  outline  here  the  complete  argument 
in  the  univariate  case  which  contains  all  the  essential  ingredients. 

Proof  To  see  (15)  without  loss  of  generality  we  may  and  do  assume  that 
r  >  1.  Let  x  be  any  point  in  the  interval  fl,  set  4>  —  f  —  h,  let  5o  be  a  positive 
constant  which  will  be  specified  later,  and  suppose  5  is  a  positive  number  <  £0. 
Suppose  k  is  any  positive  integer  such  that  2 k6  <  b  and  let  flz  be  any  subinterval 
of  fl  which  contains  x  and  has  length  (2k  —  1)5.  Subdivide  flr  into  2Ar  —  1 
subintervals  of  length  5;  namely, 

fl*  -=  Uj=i  [a{,  at+i] 

where  aj  +  i  -  a(  =  5  for  /  =  1, . . . ,  2k  —  1.  Let  x}  be  any  point  in  the  intersection 
of  A'  and  the  subinterval  [a2j_i,a2j],  j  =  1  ,...,k. 

If  p  is  the  polynomial  of  degree  k  -  1  which  interpolates  <t>  on  {xj,...,i*} 
then  Kowalcwski’s  exact  remainder  formula  for  polynomial  interpolation  reads 

(ir)  4>(x)  -  p(x)  =  £/><*)  jf(*>  - 

where  the  Ifi s  are  the  Lagrange  interpolating  polynomials  and  <j>^  denotes  the 
derivative  of  d>  of  order  k ;  see  [l,  page  72]  for  details.  Since  p  =  0,  we  may 
estimate  <t>(x)  by  manipulating  the  right  hand  side  of  (17)  to  obtain 

(is)  w*)i< 

j=i 

where  ||  •  denotes  the  L°°  norm. 

To  estimate  ]]^^l||oo  observe  that  if  k  >  m  +  1  the  polynomial  part  of  d>  is 
annihilated  by  the  derivative  of  order  k.  In  view  of  (7)  for  sufficiently  large  k 
we  may  write 

*(t)(0=^-  e'«‘0 W'mduiQ 

where  rfr  is  in  L7(Rn ,  dp).  Applying  Schwartz’s  inequality  to  the  last  formula 
results  in 

\  ■>  1/2  r  poo  s  1/2 

(19)  SMi)\  {JJMO}  ■ 
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Now 


f  roo  \  i/t  r  [oo  \ 

{ I  £2i<MOj  <(2 p)kk’k  and  |  J  my?drtt)j  =|H|*, 

and,  by  virtue  of  (11),  ||<£||a  <  ||/[|>,  .  Thus  (19)  simplifies  to 


|e'"'ibo  <  — (2p)*A:'*||/|U  • 


(20) 

To  estimate  the  remaining  terms  in  (18)  recall  that 


Ar(x)  =  II  - ~~~  ’  lx  ~  x]\  -  2&6  ,  and  \x}  —  x,  j  >  |  j  —  i\S  . 


xt 


By  virtue  of  these  inequalities  we  may  write 

k 

(21)  £|/7(x)||x,-x|*  <e«*(2tt)‘. 

;=i 

Finally,  substitute  (20)  and  (21)  into  (19)  to  get 

l*(x)l  <  ^e4t(2*S)‘(2p)**lt||/!U. 

This  expression  can  be  simplified  by  using  Stirling’s  approximation  for  k\  and 
writing 

(22)  Jo(x))  < 

where  a  is  a  constant  multiple  of  p  and,  what  is  important,  independent  of  all 
other  parameters  under  consideration. 

Now  comes  the  tricky  part.  Let 

f  f?Y  2  1  ,  r  1 

(  \3J  6„  J  3rjj(m  +  l)r 

Choose  k  so  that  1  <  3 (jcS)'  k  <  2  .  These  choices  imply  that  k  >  m  +  1  and, 
since  r  >  1,  2 6k  <  26kT  <  b,,  so  that  all  the  calculations  leading  to  (22)  are 
valid.  Furthermore,  {at)'irk  <  2/3  and  k  >  l/(3(u,5)1,,r)  so  that  (22)  implies 
(15)  with  X  =  (2/3)lyf'3“  1  .  I  his  completes  the  proof  of  (15). 

To  see  (16)  assume  0  <  r  <  1 ,  let  a  be  the  constant  in  (22),  and  set 


<5 1  =  min 


b. 


I  i  m  +  1 ) 


(l  f  2\’\ 
V  'J  \b„  J  J 


!/(!-») 


Suppose  5  is  a  positive  number  <  b  \  and  take  the  integer  k  so  that  it  satisfies 
fco/2  <  2 tk  <  60.  This  choice  "f  k  ensures  that  k  >  m  +  1  and  2 Sk  <  bo  so  that 
all  the  calculations  leading  to  (22)  are  valid.  Furthermore,  k  <  bo/{2f)  so  that 


mn*  <{«(£)  } 


and  since  the  expression  in  braces  is  <  1  and  k  >  5„/( -If)  we  may  write 

t  f  y  i 


M&rM-GO’} 


£  4 
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Finally,  in  view  of  (22)  and  the  above  inequab'ties,  we  may  conclude  that  if 
0  <  r  <  1  and  0  <  6  <  bj  then  (16)  holds  with 


C  = 


( fr ” 


and  c  = 


(1  ~  T)b0 
4 


We  draw  the  reader’s  attention  to  the  foUowing  points  concerning  these 
theorems: 

•  The  hypotheses  imply  that  A’  is  unisolvent  for  V * 

•  In  view  of  well  known  estimate  on  £*(h,  r),  Theorem  1  implies  that  er¬ 
ror  bound  is  0(£(l+1^2)  as  6  — ►  0  for  arbitrarily  large  k  whenever  h  is 
sufficiently  smooth. 

•  The  cube  17  can  be  replaced  by  more  general  regions.  For  example,  by 
using  affine  transformations  it  should  be  clear  that  17  can  be  taken  to  be 
a  parallelpiped. 

•  Consider  (14)  and  (16)  in  Theorem  2.  In  the  case  r  =  0  the  measure  p  has 
compact  support  and  so  do  the  members  of  C *.  In  this  case  the  constant 
C  in  (16)  can  be  taken  to  be  independent  of  b 0  and  c  can  be  taken  to  be 
&o/4.  Taking  bo  =  oo,  for  example  this  is  the  case  whenever  17  contains 
an  open  cone,  it  follows  that  s(x)  —  f(x).  This  means,  of  course,  that  the 
values  of  f  on  A"  uniquely  determine  /. 

•  The  estimate  needed  for  n-variate  polynomials  to  replace  the  estimate  on 
53  |^(x)|  in  the  above  argument  may  be  found  in  [8).  The  remaining 
ingredients  needed  for  the  proof  in  the  general  n-variate  case  can  be  found 
in  [7], 


4  Miscellaneous  examples  and  remarks 

Observe  that  the  examples  mentioned  in  the  introduction  h(x)  =  e“l*l  ,  (1  + 
|x|2)-i»  +  i)/2,  anc|  _ yrr  |x|2  are  very  smooth  so  that  Ek{h,t)  =  0(<:t  +  1)  for 
any  non-negative  integer  k  in  all  these  cases.  The  first  example  satisfies  (14) 
with  r  =  1/2  and  the  other  two  satisfy  (14)  with  r  =  1.  For  more  examples  of 
this  type  see  [7]. 

The  the  function  h  defined  by 


h{x) 


=  n^ 
2  =  1 


where  are  the  coordinates  of  x,  in  other  words  x  —  (ij,...,xR),  is 

conditionally  positive  definite  of  order  zero  and  satisfies  (14)  with  r  =  0  and 

p  =  \A- 

The  other  example  mentioned  in  the  introduction  h(x)  =  — |x|  is  not  so 
smooth  and  fails  to  satisfy  (14).  More  generally  we  may  consider  h(x)  =  |x|“, 
a  >  0,  which  fail  to  satisfy  (14)  but  enjoy  E*(/i,t)  =  C(t“)  as  c  — *  0  whenever 
k  >  a  —  1.  Thus  Theorem  1  implies  error  bounds  which  are  0{ba^)  as  8  — >  0 
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for  these  examples.  Such  error  bounds  are  not  optimal.  In  the  case  n  =  1 
and  a  =  1  or  3  the  corresponding  int.erpolants  are  piecewise  linear  or  piecewise 
cubic  splines  respectively;  the  optimal  pointwise  error  estimates  are  known  to 
be  0(6°+1)  as  5  —  0  in  these  instances,  see  [l  1  j.  In  the  case  n  =  3  and  a  =  1 
Nelson  [4]  has  shown  that  a  pointwise  error  bound  is  possible  which  is  0(64)  as 
6-0. 

The  variational  theory  of  /i-splines  introduced  in  [5]  may  be  regarded  as 
an  extension  of  [2].  The  motivation  for  this  extension  was  a  question  implic¬ 
itly  raised  in  [3]  concerning  the  invertibility  of  (atJ)  =  ( y/l  +  \xi~— Xy|)  which 
this  theory  settled.  An  alternate,  apparently  more  appealing,  solution  to  this 
question  is  Micchelli’s  generalization  [9]  of  Schoenberg’s  theorem  [10]  concern¬ 
ing  positive  definite  functions;  this  extension  inspired  many  related  publications 
which,  because  of  space  limitations,  cannot  be  cited  here,  however,  see  the  re¬ 
lated  articles  by  Buhmann,  Powell,  Schaback,  and  Ward  in  these  proceedings. 
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